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PKEFACE. 


The following elementary treatment of the Method of Least 
Squares has grown out of my attempts to so prigsent the sub- 
ject to students of physics, astronomy, and engineering, that 
a working knowledge based upon an appreciation of its prin- 
ciples might be acquired with a moderate expenditure of time 
and labor. 

Conceiving that the ultimate warrant for the legitimacy of 
the method itself is to be found in the agreement between the 
observed distribution of residuals and the distribution repre- 
sented by the error curve, I have not scrupled to abandon 
altogether the analytical demonstrations of the equation of 
this curve and "^o present it as an empirical formula, represent- 
ing the generalized experience of observers. The evidence in 
support of a formula of this kind is necessarily cumulative, 
and the few curves which are presented in illustration of the 
law of error are to be considered as samples of the kind of 
evidence which exists in great abundance. By abandoning 
the theoretical demonstrations, tl:^ student is freed from the 
embarrassments which are usually encountered at the thresh-** 
old of the subject, and which in many cases cause it to appear 
as a mathematical puzzle whose analytical difdculties absorb 
the attention of the tyro to the complete exclusion of the pur- 
poses for which the analysis is conducted. 

I have sought to give prominence to the distinction between 
accidental and systematic errors, and to insist upon the limi- 
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tations wMcli result from the difference between these two 
classes of error. To illustrate the principles of the text, I 
have made free use of numerical data and have arranged the 
computations in forms which experience has shown to be 
convenient for the purpose, with a view to their subsequent 
use by the student as models for his own computations. 

In the preparation of these pages, I have consulted many, 
if not most, of the standard treatises upon the subject, but 
my indebtedness for suggestions and methods of treatment 
is principally to 

Fate, Qours d’Astronom de I’J^coIe Pol^Jmgue. 

Oppoizbs, Lihrbuch der BahMinmmg. 

Weight, Tr&idm on tk AdjvMmenI; of OhsmoMons. 


G. C. C. 
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th:e. method of least squaees. 


§ 1. Pkoblem. To determine the coefficient of linear expan- 
on. of a certain bar of metal its length was determined at 
ent temperatures hj comparison with a standard of known 
The data furnished by the measures are ( KoUrmsch, 
der Fhysik) : 


Temperature. 

20° C. 

40 

60 

60 


Observed X<ength« 
mm. 

1000.22 

1000.66 

1000.90 

" 1001.06 


^ "N 

i xs requir6d4iQ__deterniine from these observations the amount 
' tlxe expansion of the bar per degree Centigrade. 

1^ c denote ‘5ie required expansion, and Iq the length of the 
ir ^?vlaen its temperature is 0° C., its length, Z, at any other 
ni^pexature, t, may be represented by the equation 

Iq t • C ^ I 


means of this equation the four observations recorded 
xov'e a.re transformed into the following obsei'vation equations: 


(1) + 20 0 = 1000.22'^ 

(2) Zo + *10 c = 1000.65 

(3) + 50 c = 1000.90 

(4) 7o -f 00 c = 1001.05 J 




( 1 ) 


Any" two of these eq\iations are sufficient to determine the 
dues of Iq and c, but the values derived from different pairs 
; eq^ixations will be different. Thus we may hud from 
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Equations. 

^0 

o 



mm. 

(1) and (2> 

999.79 

+ 0.0215 

(1) and (4) 

999.80 

.0208 

(2) and (3) 

999.65 

.0250 

(3) and (4) 

1000.15 

.0150 

etc. 

etc. 

etc. 


We are here presented with a problem of constant recur- 
rence in the investigations and applications of physical science. 
In order to determine the values of certain quantities with a 
high degree of precision more measures or observations are 
made than are absolutely necessary, and these observations 
prove to be inconsistent among themselves, so that the result- 
ing values of the unknown quantities depend upon the manner 
in which the data are combined. It is evident that all of the 
values above found for Iq and c cannot be correct, and it is 
doubtful if any absolutely correct value can be derived from 
the data ; but it is also apparent that the observations are not 
worthless and that any of the values above derived may be 
considered as approximations, more or less close, to the true 
values of the required quantities. ^ 

If we assume that the relation between the length of a bar 
and its temperature can be* expressed by an equation of the 
form employed above, we must suppose that the discordances 
, in the results are due to errors in the observations, and the 
problem then becomes : 

To find from the observed data a set of results which shall 
be affected as little as possible by the errors of Che data, or in 
more technical language, to find the most probable values of 
the unknown quantities. 

We may establish in advance of any formal investigation of 
this problem certain principles to which its solution must con- 
form. Thus, 

(A.) The adopted values of the quantities which are to be 
determined must be based upon all the data available. Only 
in exceptional cases, which will be considered hereafter, is it 
proper to omit or reject any observation or any known rela- 
tion among the quantities. 
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(B) Tlie adopted values must satisfy the observation equa- 
tions as nearly as possible. 


§ 2. Errors and Residuals. The expression e^ror of an 
observation has been, freely used in the j)receding section, but 
it should be recognized that the amount of this error can 
rarely, if ever, be known, since this would imply an* exact 
knowledge of the unknown quantities. Wo may, however, 
obtain approximate values of these errors from tho adopted 
values of the quantities which were to bo determinoeb Thus, 
if the values Iq = 999.79, c = -b 0.0215 be substituted in equa- 
tions (1), these become 

(1) 1000.22 = 1000.22 (3) 1000.87 » 1000.90 

(2) 1000.65 == 1000.65 (4) 1001.08 =« 100 1.05 

The difference between the first and second members of any 
one of thes.e__equations is called the residnal of that equation, 
and is approximately the error of the corresponding observa- 
tion. The residuals, v, which correspond to the several values 
of Zq and c derived in § 1 are given below in tabular form. 


Zq == OOS.I® 900.80 

c = + 0.0216 +0.0208 


V = 0.00 0.00 

.00 + .02 

— .03 + .00 

+ .03 ,00 


999.05 


+ 0.0250 

+ 0.07 
.00 
.00 

— .10 


1000.15 
4 0.0150 

— 0.23 

— .10 
.00 
.00 


We may thus, for any assumed values of tho unknown quan- 
tities, find a corresponding set of* residuals, anti tho smaller 
these residuals are, the closer is the probable iipproxiination of 
the assumed, to the true values. Principle (H). 

This statement, however, requires an important qualification 
to which we now proceed. 

The errors with which any given series of observations is 
affected may bo tlivided into two classes : 

Accid^eal Errors, or those whoso law oE reeurrenc.o i.s such 
that m the long run they are aa ..Eton p<).sitiva aa uegutivo and 
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whose effect upon the mean of a great number of observations 
therefore differs but little from zero ; and 

S^steynatic JSrrorS) or those which in. the given, series of 
observations do not thus tend to be eliminated from the mean. 
In the observations considered in § 1, an error of judgment by 
which the observer in a given case read the thermometer 0®.l 
too high would probably be an accidental error, since it may 
be presumed that in the long run he would read it as often too 
low as too high, but if through a fixed habit of observing, the 
thermometer were ^always read too high this would be a sys- 
tematic error, and the number of observations might be indefi- 
nitely increased without in the least diminishing its effect. 

If the standard of length with which the bar was compared 
were an erroneous standard (e.f/. 0.01 mm. too long), all of the 
observations would be affected with a systematic error due to 
this source, and the residuals would furnish no trace of this 
error, since they show only discordances among the observar 
tions, and not errors affecting all alike. The smallness of the 
residuals in any case, therefore, furnishes no guaranty that the 
observations and the results derived from them have not been 
vitiated by systematic errors. 

The presence of errors of fliis class constitutes the greatest 
obstacle to the accurate determination of any set of quantities 
whose values are sought, and the ingenuity and skill of the 
observer or experimenter cannot be better employed than in 
avoiding or overcoming the effect of such errors. It therefore 
deserves especial, notice that systematic errors can often be 
transformed into accidental errors by varying the methods of 
observation or the conditions under which the observations are 
made. Thus the possible systematic error of judgm'ent in 
reading a thermometer, to which allusion was made above, may 
be transformed into an accidental error if several different per- 
sons take part in the observations, since it is hardly probable 
that they will all have a common, persistent error of judgment. 
The error due to using an erroneous standard of length may 
be changed into accidental error by employing a number of 
different standards, since, it is not probable that these, con- 
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structed afc different times and by different makers, will all 
have a common error of length. Considerations of this char- 
acter serve to illustrate the great practical importance of vary- 
ing the methods of determining any q^uantity whose value is 
desired with great precision. Multiplying observations by the 
same method and under similar circumstances serves only to 
diminish the effect of accidental errors and is useless beyond a 
certain limit, while varying the methods and the circumstances 
under which observations are made tends to eliminate errors 
of both kinds. 

The principles here considered find their appropriate appli- 
cation in the selection of the methods by which any given set 
of unknown quantities is to be determined, but after the obser- 
vations have' been made, since they can, in general, furnish but 
little, if' -any, information in regard to their own systematic 
errors, these must be neglected and the reduction and discus- 
sion of the'observations directed toward eliminating the effect 
of the accidental errors. 

*■ * m 

§ 3. The Distribution of flesiduals. Gauss, a German mathe- 
matician, has shown by a course of analysis based upon the 
theory of probabilities, that in any given series comprising a 
very large number of observations affected with accidental 
errors, the number of errors of a given magnitude, cc, is a func- 
tion of that .magnitude. Thus, if x' and a;'' denote any two 
errors, and and y” the number of observations having the 
errors a?' and x'' respectively, then 

y* i : i f{x*) I 

The analytical expression for jr(a;)<^ obtained by Gauss is 

/(a:) = -7= (2) 

V TT 

where e = base of the hTaperian system of logarithms, 

TT = ratio of the circumference to the diameter of a circle, 
h = ix number whose value must be derived for each series 
of observations, but is constant for all the obser- 
vations of that series*. ■ 



6 


THE METHOD OP LEAST SQUARES. 


The same expression for /(x) lias been derived by other 
mathematicians through different courses of analysis, but 
against all of these investigations objections of a theoretical 
character have been urged. Experience, however, shows that 
the actual distribution of residuals does follow this law, not 
with absolute accuracy, but to a remarkable degree of approxi- 
mation. An excellent illustration of this distribution in the 
case of a comparatively small number of observations is 
afforded by a sieries of 60 determinations of the velocity of 
light made at Washington, in the year 1882.* By means of a 
revolving mirror the time required for the passage of a ray of 
sunlight from one terrestrial point to another was measured- 
The mean of the 66 determinations of this time interval, was 
24.82T millionths of a second. By subtracting this mean fronx 
each single determination a series of residuals will be obtained, 
and the number of residuals whose magnitude equals 1, 2, 3, 
etc. units may then be counted. In this way a fair approxi- 
mation to the distribution of residuals represented by G-auss’s 
law of error will be found; but as this law purports to repre- 
sent the average distribution of a great number of errors, we 
shall obtain a better comparison between it and the actual dis- 
tribution by the following de'vice, to which we resort in order 
to increase the number of available residuals : 

Let it be assumed that in any given set of observations the 
number of residuals of magnitude x is proportional to the num- 
ber of residuals occurring between the limits x — a ancl 
where a is a quantity which in strictness ought to be an infini- 
tesimal, but 'which may be made a small finite quantity withoxit 
appreciable error. In the present case we adopt as the unit in. 
which the residuals are to be expressed, the thousand-millionth 
part of a second (O'.OOOOOOOOl), and put a equal to two such 
units. Thus, from a series of 66 observations are derived the 
following numbers which, represent the distribution of resid- 
uals which might be expected to occur in a much longer series. 


^ Velocity of Ligkt In Air and Refracting Media. Bureau of Havi- 
gation, Navy Department, 1885, p. 187, 
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ItefiiiluaLi /a 

thaa — 13.5 2 0.8 

Equal to —13.5 O 0.0 

— 12.5 . 2 0.8 

— 11.5 2 0.8 

— 10.5 2 0.8 

- 9.5 3 1.2 

— 8.5 2 0.8 

— 7.5 4 1.6 

— 6.5 6 2.3 

— 5.5 8 3.1 

— 4.5 12 4.7 

— 3.5 16 6.8 

— 2.6 18 7.0 

— 1.5/ 21 8.2 

— 0,6 23 8.9 

> 


Kesidiial. No. % 

Greater than + 13.5 0 0.0 

Equal to +13.5 .2 0.8 

+ 13.5 2 0.8 

+ 11.5 3 1.2 

+ 10.5 6 2.3 

+ 9.5 6 1.9 

+ 8.0 6 2.3 

+ 7.5 7 2.7 

+ 6.5 8 3.1 

+ 5.5 10 3.9 

+ 4.6 12 4.7 

+ 3.5 15 6.8 

+ 2.5 17 6.6 

+ 1.5 21 8.2 

+ 0.5 23 8.9 


The coUumiL headed % represents the number of residuals 
differing not more than half a unit from the magnitude given 
in the first column; expressed as a percentage of the whole 
number of residuals. Fig. A furnishes a graphical representa- 
tion of this distribution, each percentage in the above table 
being represented by a point whose abscissa is th^ magnitude 
of the residual and whose ordinate is the percentage itself. 
The curve whose^quation is 


y = ft = 0.158 

“s/tt 

is shown in the same figure, and a simple inspection of the 
curve shows that its ordinates represent very approximately 
the percentage of residuals of each magnitude. The coeffi- 
cient h appears multiplied by the factor 100 in order that the 
ordinates may.be represented as pefdbntages. 

Figs. B, C, D, represent the distribution of residuals in three 
other series of observations of different kinds, made at differ- 
ent places, by different observers, but all following the same 
law. The unit in which the residuals axe expressed, unit of aj, 
is stated with each figure, axxd the unit of y is in every case 
one per cent of the whole number of residuals. 

The equations of the several curyes shown in the figures are 
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almost identical, but the feature to wliicli the stxideiit’s atteU' 
tion is called is that the alg'obraic form of the eq[uatioa is ini 
each case 

V = — 

and not that U has approximately the same value in each^ 
curve. The numerical value of h depends upon the xinitl 
adopted for a;, and these units having been chosen with refers 
ence to a convenient graphical representation of the residuals, : 
the agreement in the- several values of U must be regarded as 
purely artihcial. 

The series of observations represented in Fig. D is known! 
to be affected with small systematic errors, . and it will bi 
noted that the distribution of the residuals is more irregularj 
in this case than in any of the others. In each of the series 
represented in Figs. A and C, there are two residuals whosb 
magnitudes are too great to be represented in the figures ; and! 
it is quite generally found that the actual number of very large 
residuals is slightly greater than the number given by the 
error curve. The illustrations here given are typical cases, 
and' may serve to exemplify the statement made at the begin-i 
ning of this section*, that the actual distribution of residuals is 
found to follow G-auss's law of error, and in the following sec-[ 
tions this lawwill be assumed as experimentally demonstrated,! 
and from it will be derived the method of combining and dis-j 
cussing observations. The student will find it an instructive] 
exercise to treat in a manner similar to that pursued above 
any series of observations to which he may have access, par- 
ticularly his own observations, and thus lend additional weight, 
to the experimental evidence which is hero presented for his; 
consideration. ’ . 


I 


§ 4. The Error Curve. From the manner in which the ordi- 
nates of the points .plotted in Figs. A, B, C, and D were derived, 
it will be apparent that these ordinates represent the nmnber 
of residuals falling within certain chosen limits of error. Thus 
in Fig. A, 8.0 per cent of all tho residuals lie between the 
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limits 0 and +1, 8.3 per cent between +1 and etc., tb© 
interval within which the residuals are enumerated being in 
every case one unit. It is also evident that the number of 
residuals falling within any other interval, A a;, will depend 
U]3on the magnitude of this interval as well as upon the ordi- 
nate corresponding to it, and if A a? is taken sufficiently small 
the number of residuals will be proportional to the product 
2/ • A 05 . Geometrically considered, this product is the area in- 
cluded .between the axis of as, the curve, and the two ordinates 
drawn through the extremities of A as, and the number of resid- 
uals falling within the limits of A a? is therefore proportional 
to thisL.a.rea«r'' We may, if we choose, make A a? an infinitesimal, 
and the area y • A a5 and the corresponding number of residuals 
will then become indefinitely small, but by taking the sum of 
all the infinitesimal areas included between the limits as = a 
and a? = where a and b have any values whatever, we obtain 
the area of that part of the curve included between ordinates 
drawn at these limits. By a similar process of summation we 
obtain the number of residuals lying between a and 6, and the 
number of residuals thus found must be proportional to the 
area, since this proportionality is true in every infinitesimal 
element include^ in the area. 

In the following table, the function, A, represents the area 
of that part of the error curve included between ordinates 
whose abscissas are 0 and x, the argument of the table being 
the values of x for the particular error curve in whose equa- 
tion 7i. = 1 ; but the area included between 0 and x in the curve 
corresponding to any other value of h may be found from the 
same table,' by using as the argument hx instead of x. 

The area of that part of the curve lying between the limits 
a and 6 is represented by 

A = C ydx = C dx (3) 

•J a •TT*^ “ 

Tet the variable in this expression be changed by putting 
hx = t, and the expression becomes 
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These expressions for become identical if 7i = 1 ; hence, if 
the value of A be computed from the second integral for ^ = 1 , 
and tabulated, vre may find from this table the value of A cor- 
responding to any other value of h by changing the limits a, 

& into ha and hb. 

A remarkable property of the curve, which will be of use 
hereafter, may be readily obtained from the expression here 
found for Al If we make a = 0 and 6 = oo, the limits of the 
integral become 0 and oo for all values of h, hence the area of 
that part of the ^curve included between a; = 0 and cc = oo is 
the same for all values of 7i, i.e., for every series of observations. 


Table oe Areas oe the Error Curve betweejt the- 

Limits 0 ajsti) hx. 


hx 


hx 

^ ruff . 

hx 

A niff .' 

0.0 

0.000 

• 1.0 

0.421 

2.0 

0.49766 


66 


19 


85 

0.1 

.066 

1.1 

.440 

2.1 

.49851 


55 


15 


66 

0.2 

.111 

1,2 

.455 

2.2 

.49907 


63 


12 


36 

0.3 

.164 

1.3 

.467 

2.3 

.49943 


60 


9 


23 

0.4: 

.214 

1.4 

.476 

2.4 

,49966 


46 


7 


14 

0,5 

.260 

1,5 

.483 

2.5 

,49980 


42 


5 


9 

0.6 

.302 

1,6 

.488 

2.6 

.40989 


37 


4 


4 

0.7 

.339 - 

, 1.7 

.492 

2.7 

-49993 


32 1 


3 


3 

0.8 

.371 1 

1.8 

.496 

2.8 

.49996 


27 


1 


2 


.398 

1.9, 

.496 

2.9 

.49998 


23 


2 


1 

1.0 

.421 

2.0 

.498 

3.0 

.49099 





00 

.50000 


If in any scries of observations denote the number of 
residuals whose magnitudes are included between the limits a 
and 6,. n the whole number of residuals in the series, aud 
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Ai the values of A obtained from the table with the arguments 
Jia, 7ibf then 

qp A) 

since the ratio of n]* to n is eq^ual to the ratio of the area of 
that part of the error curve which lies between the limits a 
and bj to the area of the whole curve, and this latter area is 
seen from the table to be always unity. The — sign in this 
eq_uation is tbv be used when a and b have like signs, and the + 
when they have unlike signs. If the percentage of residuals 
between the limits a and 6 is required, it may be found by 
substituting IQO in place of n as the coef&cient of (Aj,'::fAa)* 
Thus from iFig. A we j&nd for the series of observations there 
represented, = 0.025 and 7i = 0.158. 

To find the distribution of residuals between the limits 
— oo • • • — 5, 5 • • • “”*■ 2, — 2 . . . -|- Ij 1 . . . 4, -f" 4: • • • 00 , 
we proceed as follows : 


so 

4 

hso 




F«r cent. 

Obs. 





*66 


— oo 

— oo 

0.500 



/ 


— 5 

-0.790 , 

.368 

0.132 

9 

13.2 

9 

— 2 

--o-ste 

.172 

.196 

13 

19.6 

11 




.260 

17 

26.0 

17 

"i” 1 

“j- 0. 153 

.088 





+ 4 

+ 0.632 

.314 

.226 

16 

22.6 

13 


■ 


.186 

■ 12 

18.6 

16 

+ QO 

+ oo 

.600 









66 

100.0 

66 


The numbers in the column ''Per cent” may be compared 
with the percentages given on page 7. The column "Obs.” 
gives the actual number of residui^is which occur in the given 
series between the limits here considered, and these numbers 
should be compared with the column 

By the use of this table, the distribution of residuals in any 
series of observations for which the value of 7i is known may 
be compared with the theoretical distribution much more 
readily than by plotting a curve, and the student should in 
this way examine several series of observations. The method 
of determining 7i for any given series is contained in § 12- 
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§ 5. The Principle of least Sq[uares. The riuaiitity h which 
appears in the equation of the error curve deserves especial 
attention. .If in the equation 

y -= Q—h^X'* 

X be put equal to zero, the resulting value of y is This is 

V-TT 

the maximum ordinate of the curve, and the value of this 
maximum ordinate varies directly as 7i. If those parts of the 
curve remote from the axis of y be cousidered, it vp-ill be fouud 
that the larger is h, the smaller are the values of yj since 'wheu 
a; is a large quantity diminishes much more rapicOy for 

increasing values of h than h itself increases. These relations 
bet'ween y and h correspond exactly to the criteria by which 
we estimate the precision of observations. If we compare two 
series of observations, I. and II., and find that in series I. the 
small errors are relatively more numerous {large values of y 
for small os’s), aud the large errors less numerous {small values 
of y for large a?'s), than in. series II., we shall without hesita- 
tion call the observations of series I. more precise or accurate 
than those of series II. and if required to assign definite 
meanings to the terms ^fmore precise’^ and ^^less precise,” we 
shall find difficulty in defining them in any other manner than 
by reference to the magnitude of the i*esiduals. We therefore 
adopt as the measiire of precision of any series of observations 
the value of h in the equation of its error curve ; and having 
thus defined the term ‘^precision,” we are able to state two 
principles which are of general application in the discussion 
of observations. 

Let the data furnished by each observation be exx>Tessed in 
the form of an observation equation (Equations 1, ^ 1), then : 
the best attainable values of the unknown quantities are those 
which, 

(1) Distribute the residuals in accordance with the law of 
error, y = •—p. and which, 
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(2) jMake the value of h in the equation of the resulting 
error curve a maxinuim. 


The first of these principles is indeed involved in the sec- 
ond, since if the residuals are not distributed in accordance 
with the law 


2 / = 




there can he no value of h to he made a maximum. It is, how- 
ever, advantageous to state (1) as a separate principle, since it 
affords a test of the presence of systematic errors in the data^ 
which, though far from being a perfect criterion, is often con- 
venient and is sometimes the only test available. 

To justify the statement of (2), we resort to the following 
considerations : In accordance with A, § 1, we suppose that aU 
of the data available is contained in the observation equations, 
and, B, § 1, we seek to satisfy aU of these equations as nearly 
as possible. If the observations are free fr6m systematic 
error, a supposition which must here be made, since we have 
no means of taking into account the effect of su,6h errors, we 
may obtain by sxibstituting in the observation equations any 
set of values which approximately satisfy them, a correspond- 
ing set of residuals which will be the errors of the observations, 
on the supposition that the substituted values were the true 
values of the unknown quantities. If these residuals are plot- 
ted in an error curve, they will furnish a numerical measure 
of the i^rccision 7i, assigned to the observations by this set of 
values, and out of all possible sets of values of the unknown 
quantities that set which assigns .the maximum precision to 
the observations will be entitled to the greatest degree of con- 
fidence ; for if it were otherwise, we should have no reason for 
preferring a set of values which exactly satisfied all of the 
equations to a set which did not satisfy them. 

It is, of course, true that subsequent observations may fur- 
nish a better determination of the unknowns, and that the 
values thus found will not assign to the earlier observations as 
high a degree of precision as did the erroneous values obtained 



14 


THE METHOD OF LEAST SQTJAEES. 


from those observations alone,’ but this subsequent determina- 
tion is based upon additional evidence, and the problem with 
which we are concerned is not to obtain the best possible val- 
ues of the unknown quantities, but the best values which can 
be derived from the data in our possession. 

Assuming, then, the validity of (2), we proceed to transform 
it into an expression more convenient for practical use, and for 
this purpose we resort to the following property of the error 
curve, which may be approximately verified by actual measure- 
ment from any" of the plotted curves, Figs. A, B, C, B. 

If the error curve be divided - into a great number of parts 
by drawing equidistant ordinates throughout its whole extent, 
and the areas of the several parts into which the curve is thus 
divided be each multiplied by the square of the abscissa of its 

middle point, the sum of all these products will equal 

The analytical expression for the process above described is 

0 7 ) •/**> 

I cc^ydx or I 

9J—93 


Put 7ix = t, and this integral becomes 



6”^ dt 


1 

2h? 


For the method of obtaining the value of the last integral, 
see Newcomb^ s Calculus^ Articles 169, 176. 

The area of each of the parts into which the curve was 
divided is proportional to the number of residuals occurring 
between the limiting ordinates of the part ; thus, let A. denote 
the area of the part,‘*j^ the corresponding number of residuals, 
and n and a the whole number of residuals and the whole area 
of the curve respectively ; then 


A: N" i : am 


but from the table in § 4, a = 1, whence 

A and Ax^ = 
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Since JT denotes the number of ar’s falling within the given, 
infinitesimal part, .-d, of the cux've, is e(iual to the sum 
of the squares of the a^s (residuals) whose niagnitudes fall 
between the limiting ordinates of -d, and taking the sum of all 
the we obtain ^ 

S ^la;2 = i :S aj= 


le., is equal to the mean of the squares of all the residuals. 
It is customary to represent the sum of the squares of the 
residuals by the symbol [['wu], v standing for any residual, and 
the £ 3 denoting the sum of all quantities of the kind written 
within them. 

Comparing this result with, the one obtained above, we have 


n 2/i® 



from which it appears that the relation between h and the 
sum. of the squares of the residuals is such that when h is 
a maximum,* [w] is a minimum, and principle (2) may be 
restated as follows : 

'The most probable 'values of the unTcnoion qti^ntities are those 
which tncthe the sum of the squa7'es of the I'esidicals a Tnini^nuiri, 

Eroni this " principle has been derived the name Method of 
feast jSqua^'es, which is commonly applied to that body of 
principles which treats of the combination and discussion 
of observed data. 

We have arrived at this principle from a consideratiou of 
that class of cases in which the quantity observed is a func- 
tion of two or more unknown quantities whose values are to 
he obtained from the ohservatioifts. GChis obviously includes 
the case of ' a single quantity, cc, whose value is directly meas- 
ured ; and it will be advantageous ■ to apply the principle of 
least squares to this case. The observation equations are here 
of the simplest possible form. 

03 = 

03 = 7n.i 

ca = 
etd. 

where m denotes an observed value of a;. 
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If ccq denote any assumed value of x, tlie residuals obtained 
by substitriting- it in these eq^uations will be 

— '}}}>]_ ■“ 3 J(j 
Vo = Wig CCq 

Vi = 7)Iq — Xq 

•*« «•« 

and ['vv] = (7711 — a?o) ^ -1- (mg — a^o) * 4 * — ajp) ® • • * +• — a^o) ® 

The value of which will make [ij-u] a minimum is found 
from 

^C~il=o= --2(mi--aJo) — 2(m2— ajo) — 2 (m 8 —a 3 o) 2 (m„~a 3 o) 

ctx^ 

but this equation is equivalent to 

mj. 4 “ 4- m3 -( — » 4 - TTin [m] 

n n 

and it thus api>ears that the universal practice of taking the 
arithmetical mean of all the measures of a single quantity as 
the best value of that quantity, is a particular ease under the 
more general method of -least squares. 

§ 6. Weights. It frequently happens that the circumstances 
under which an observation was made lead the observer to dis- 
trust its accuracy, while other causes give him increased confi- 
dence in another observation. Observations which thus differ 
in quality* are said to have different weights, the weight being 
a numerical measure of the quality, and these weights should 
be taken into account in combining the observations- 

Tefc us snpi>ose two series of observations made upon the same 
unknown quantity, in one of which the individual observations 
are of different quality and entitled to different degrees of con- 
fidence, while in the other the observations are all equally good, 
but each of them entitled to less confidence than the poorest 
observation of the first series. By taking the' mean of a num- 
ber of observations of this second series, a more reliable value 
of the unknown quantity may be obtained than any single 
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observation of the series can furnish^ and by properly choosing 
the nnxnber of observations to be iiioLiided in the mean, a value 
entitled to as much confidence as any observation of the first 
series may be found. This number of observations of the sec- 
ond series whose mean is entitled to as much confidence as a 
single observation, of the first series, is called the weight of the 
equivalent observation in the first series j and, obviously, the 
better an observation, the greater is its weight. These weights 
furnish no information about the absolute precision of the 
observations, but express only their relative excellence as com- 
pared with each other ; hence, if jpi, pa, ^ 3 , etc., he the weights 
of any observations, Jgojj 7(p.2> h:p^y etc., where k is any constant, 
will express these weights equally well, since it is the ratios 
of the weights, and not their absolute values, which are of 
importance. 

To exhibit the manner in which" these weights are to he 

employed, let us recur to the data of § 1 , and suppose that 

those observations were made under such conditions that the 

* 

first one has a weight 1 , the second 2 , the third 3 , and the 
fourth 4. In accordance with the definition of weights, this is 
equivalent to supposing a second series of observations of uni- 
form excellenoje, such that the first of the actual observations 
can be replaced by one observation of this series, which must 
of course be numerically the same as the observation which it 
replaces ; the second real observation may be replaced by two 
numerically equal observations of the second series ; the third 
by three, etc. Each of these substituted observations will fur- 
nish an equation precisely like those given in § 1 , and when 
the sum of the squares of the residuals is formed, we shall 
obtain ^ 

The symbol which is adopted as an abbreviation for 

this expression, is equivalent, numerically, to the sum obtained 
by multiplying the square of each actual residual by the weight 
of the corresponding observation and adding the products, and 
it is evident that this [pyu] bears the same relation to the 
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substituted observations that [yy] bore to the actual observa- 
tious ill the case of equal -weights, -which was considered in the 
preceding sectiou. The principle there obtained may there- 
fore be generalized as follows : 

The most probable values of the im'knoion quantities are those 
icJiich make the simi of the weighted squares of the residualsj 
[pryj; C6 minimum. 

Let the student show, as in the preceding section, that when 
this principle is applied to the ease of observations of unequal 
weight made upon a single unknown quantity, it gives as the 
most probable value of that quantity 




~ LP] 


As an. example of the application of weights, we select the 
following observations of the time of ending of the transit of 
Mercury of May 6, 1878, which were observed by different 
observers in the city of Washington. These observers were 
provided with telescopes of different sizes and magnifying 
powers, and differed among themselves in point of experience 
and skill, so that their observed times of last contact are not 
entitled to equal confidence. The weights assigned to the sev- 
eral observations represent the judgment of the computer with 
respect to their relative excellence. (Washington Observa- 
tions, 1876, Appendix II., page 55.) 


01>serY«<3' Time. 

P 

jmh 


6^ 38'« 

23’ 

1 

23" 

[pw] =: 318 

37 

55 

0 

0 


38 

10 ■» 

1 

10 

iPl = 16 

38 

2C 

* 3 

78 


38 

21 

2 

42 

[Pl 

38 

18 

2 

38 

38 

10 

3 

57 


38 

21 

2 

42 


38 

15 

2 

30 



Weighted mean of the observations = 6*^ SB"* ID ’.9, 
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§ 7 . Normal Equations. VTo have now to show .how the 
principle of least squares is to be applied in determining the 
values of a set of unknown quantities, and in order to fix 
the ideas as definitely as possible, let it be supposed that there 
are three of these quantities, a;, y, 2, which are connected with 
each one of a set of observed quantities, n, by the relation 

ax by cz ^ n 

where a, 6, and c are numerical coe£S.cients whose values are 
different in the several equations but are supposed known in 
each equation. From a series of more than threp observed 
values of n, the most probable values of x, y, » are to be 
obtained by means of the relation [pw] = a minimum. 

It is not to be presumed that these values* when found will 
exactly satisfy all the equations, and make = 0, but we 

shall find from each equation a residual v, so that strictly the 
observation equations should be written 

a^x .4- 4- Ciz ~ jpi 

ttasc 4- 62?/ 4“ — W2 = v* P2 

a^x 4 " + C32 — ^3 = Pa ' 

etc. etc. etc. 


The symbols pi, po, pa represent the weights assigned to the 
observed values, 710, etc. 

By the ordinary rule for determining a minimum of a func- 
tion of several variables, the condition [_x>V‘o\ = a minimum, 
furnishes the three equations 

ctr ?)w1 ■_ 0 = o 

dx dy ^ , dz 

and in order to obtain these derivatives we form from the 
observation equations 


\_pvv'\ = J 4- -h Cl Vi^2 — 

-h { aaVp2*« 4 - 4 - CaVp3« — Vpa^^S 

d- { «aV2^» 4 - 2>3 Vi?3 y 4 ~ C3 Vpa z — Vp^Waj 
etc, etc: etc. 


2-N 
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The derivativo r-t' this exi>i“o.ssioii with rcspoob to x is 

= 2 + ^iVpTi/ + Ci^Vpi^—VJ^Uil I 

dx 

+ 3 

H- 2 asVK^C£:3 Vj^ya? 4- &3 Vj>3^ + — Vi^yna} 

etc. etc. * etc. = 0 


, Let this expression be expanded, divided by 2 and siinjplifie.d 
by the introduction of [[ ] to denote the sum of all terms like 
those placed within them (all terms standing in the same verti- 
cal column), and it becomes the first of the following group of 


jisTonMAii Eqtjatio2^s, 

[_pacC\x [^2>a&]y -f- []pac]z — = 0 ] 

{^pab'\ X - 4 - S/ 4 - =r 0 > (7) 

l^pac] X -f- 2/ 4- [i>cc] 2f — = 0 J 

The second and third of the.se equations are derived in pre- 
cisely the same manner as the first from the conditions 



These equations are equal in number to the unknown quanti- 
ties, and their solution will in general furnish a determinate 
set of values for these quantities which will he the most prob- 
able values, since the normal equations include all of the data 
furnished by the observations and have been so derived as to 
satisfy the princii>le of least squares. 

Equations (0) furnish a rule which is frequently given for 
the formation of noi*mal equations. To obtain the first normal 
equation, multiply each observation equation by the ptoduct of 
its weight into the coefficient of x which occurs in it, and take 
the sum of all tlie resulting equations. The other normals are 
similarly obtained from the wei'ghts and the coefficients of y, 
z, etc., having due regard to the algebraic signs of the quanti- 
ties in the several multiplications and divisions. This method 
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id occasionally convcTiiciit, l)ub in general tlie uiGbliod of fornv 
iiig uonnul equations given in ^ 1) will be found less laborious. 

The svninietrical luaimer in which the eoelticients of the 

w 

nornral equations are disposed should, be especially noted;^ since 
this considerably diminishes the labor of their formation. The 
first coetficieiit in the second equation is the same as the second 
coefficient in. the first equation, and generally the coefficient 
in the 7^“^ equation is the same as the coefficient in the 
equation. 

Le‘t the student form normal equations from the observation 
equations contained in § 1, assuming that those equations have 
equal weights. 


§ 8. BTon-linear Observation Equations. In all of the pre- 
ceding investigation, it has been, tacitly assumed thart the 
relation of the observed to the unknown quantities can he 
expressed by an equation of the first degree ; but cases in 
which this relation is of a much more complicated character 
are net uncommon, and a method of applying the principle of 
least squares to these cases is required- For the sake of sim- 
plicity, this method will be derived for the ^'ase of two un- 
known quantities, but the process is perfectly general and can 
readily he^extended to any other number of unknowns. 

Let x and 2/ be any two quantities which have not been 
directly measured bub which are connected with an observed 
quantity, m, by the relation 

' /C-^: y") = 


which represents any equation whatever existing between a?, y, 
and m. Let cco R^d yo <lenote a]^proxiinate values of x and y, 
such that - ,4 ,A 


A a; and Ay being the corrections which must he added to a?o and 
yo in order to obtain the most probable values of x and y. We 
may, for the present, suppose that jTq and ?/o mere guesses 
at the values of x and y, and we may test their correctness by 
substituting their numerical values in the equation. 


y) 


m 
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which corresponds to each observed value of m. If every such 
eq[uafcioii were exactly satisfied by these values, we should infer 
that a’o and yo were the most probable values of x and y. It 
cannot be expected that this perfect agreement will ever be 
found in practice, but from each, observation equation' a resid- 
ual, Vj will be found, due partly to the errors of the observa- 
tions and partly to Acc and Ay. ^ ^ 

If in the equation /(a?, y) = m we substitute for x and y 
a5()-|-Aaj, yo+Ay, and develop the expression by TayVoi-^s 
J^onmilcc, remembering that m — /(aro, yo) is the residual n 
found hy substituting numerical values of ajo, y© iR the several 
observation equations, we have 


m 2/o)=^Acc-h^Ay-b 

daJo dyo 


If numerical values of be introduced into this 

equation, it becomes 

a*AcB-f-&*Ay — n.=sO 

Each observation equation may thus be made to furnish a lin- 
ear equation involving A x and A y, and these equations may 
be treated by the method of § J. It must, however, be remem- 
bered that in the above development by Taylofs Wormulcc we 
have retained only the first three terms of an infinite series, 
and if the approximate values x^, 3/0 O'!*® not so nearly the most 
probable values that the squares and higher powers of A a; and 
Ay are inappreciable, the development and the solution based 
upon it are inaccurate. On this account, it is seldom advanta- 
geous to make a least Square solution for the unknown quanti- 
ties until very approximate values of them have been found. 
These values will usually jce obtained from the solution of a 
small number of the observation equations. 

The transformation of the observation equations by the 
introduction of corrections to assumed values of the unknowns 
is often advantageous even when the original equations are of 
the first degree, especially if the original quantities were of 
very difiEerent magnitudes. Thus, in the problem of § 1 , the 
observation equations are of the form 
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711 =/(Z„, c) = 7o -h fO 

in whiijh. c is a very small q^uantifcy "while Zq is approximately 
1000. If we put 

Zq = 1000 ”1“ A Zo c 0 “f" A c 

we have = 1 = t n= m — 1000 

cZZ^ cZc 

and the eq[uations are transforined into 

■ .AZo4-20Ac — 0.22= 0 
A Zq + 40 A c 0.65 = 0 

AZo+oOAc — 0.90 = 0 
AZo+ 60 Ac— 1.05 = 0 

By this transformation the numerical operations involved in 
forming and solving the normal equations are much simplified 
through the substitution of small numbers in the place of 
large ones. 

§ 9. Formation and Solution of the Normal Equations. If 
the dumber of unknown quantities is greater tjran two, and 
especially if the number of observations is large, the numerical 
computation of .ct set of normal equations is a laborious process, 
and one in which errors are almost certain to occur unless 
special precautions are taken to guard against them. The 
method of forming these equations presented in this section 
has been developed with special reference to facilitating the 
numerical operations and obtaining the normals with the least 
expenditure of labor consistent with the requisite accuracy, 
and although some of the processes may seem at first sight 
unnecessary and cumbrous, a little experience in their iise, or 
in their neglect, will convince the student that they are in 
the long riia labor-saving devices. 

Bet each observation equation be written out and arranged 
in tabular form’, as in the following example. In order that 
these equations should furnish a good determination of the* 
unknowns, x, y, ss, it is necessary that the coefficients of these ‘ 
quantities- should present a considerable range in their values 
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iu the several equations. Thus, if all the coofficionts of a? 
were alike, all the coetli<‘ients of y e(|_uul each to each, etc., the 
equations would be absolutely indeteruiiiiate since we should 
have several unknown quantities involved in a single equation 
many times repeated, and if the cocliicients approximate to 
this equality the equations will be approximately indeter- 
minate, and will furnish unreliable values of the unknowns. 
If, therefore, several observations have been made under similar 
conditions, and furnish equations which are nearly identical, < 
these will be nearly equivalent to a repetition of the same 
equation and it- will be permissible to take their mean, having 
regard to their respective weights, and treat it as a single 
observation equation with a weight equal to the sum of the 
weights of the observations. 

Having thus reduced the number of equations as far as 
possible, each equation should be multiplied by the square root 
of its weight as was done, § 7, in obtaining the form of th'e 
normal equations. By this multiplication the weights will be 
completely taken into account and will require no fnrther 
attention. Let the toeighted equations thus obtained be repre- 
sented by 

Ujcc H- bill ^ -f- = 0 

ccocr "4* b^ 4“ 
ctsau -b 6^2/ “H "H 
etc. etc. etc. 

It will usually facilitate the formation of the normals to so ^ 
transform these equations that no number greater than 1 shall 
occur ill any of them. This can always he done by introducing 
new unknown quantities and dividing each equation by some 
constant number, usufilly some power of 10. Thus in the case 
of the two equations 

5»-f7l2/—63=:0 
0.9a; — li)3y + 93=0 

let each equation be divided by 100 and put 

Tfir»=« = 
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the equatious are thus transformed into 

1.000 -h 0.368^ — 0.630 = 0 
0.180 u ~ 1.000 ^0 -h 0.930 = 0 

The solution of these equations will furnish values of u and w 
from ’which a; and y may be found by the relations ' 

. X = 20 u y = 10 

The purpose of this transformation is to simplify the subse- 
quent numerical work by reducing the numbers involved to 
an approximate equality. 

Every coefiScient ■which appears in the normal equations is 
the sum of a series of products of two quantities, thus 

[aa\ = axax -f cufiLz ■+ * 

[aft] = Oxhx + ? 

Q6?2. J = ^-^x “h 2>2^1'2 “h -f- • » • 

* 

These products may be formed by the aid of Grell^s multipli- 
cation tables* ‘supplemented by a table of squares of num- 
' hers for the faa], [&6], etc. In case Orell^s tables are not 
available, the products may be formed by logarithms or much 
more rapidly by following method due to Bessel, Form for 
each equation the sums a -j- &, a -}- b + c, etc., for every pair 
of numbers contained in the equation ; then since 

= + — bb\ 

we have Vabl = i { [(a + bYl ~ Vaal - [&&] } ') 

[6o]=i{C(6 + o)»]-C6&]-[cc]}. I (8) 

etc. etc. etc. I 

. ^ 

The [_aa\j p6], [cc] are coefficients in the normal equations 
and must be computed in any case, and the formation of 
pc], etc., therefore requires for each coefficient only the single 
additional quantity C(u-1“&)^], and presents the 

very greal!; advantage that these quantities can be obtained 

* Crelle, Bechentafeln,. Berlin. These tables give the products of 
all numbers up to 1000 X 1000, and axe of very general utility. 
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from a table of squares, and being all positive numbers no 
attention need be x>aid to the signs after the sums a -f- 6, 6 q- c, 
etc., have been formed. 

iTo method of computation can furnish a guaranty against 
the commission of numerical errors, and it is therefore desir- 
able to test the computation from time to time to ascertain if 
such errors have occurred. To secure such a test or “ check,” 
as it is called, we introduce the following auxiliary quantities, 
one for each observation equation ; 


Si = (Xi — h &i "d" Cj "4" * * * "4“ 

Sg = 0t2 -4" C 2 ”4“ • • • “4* 7^2 

etc. etc. etc. 



and form the quantities [as] [5s] ••• [s?i]. It will appear from, 
the mode in which the coefBLcients of the normal equations 
are formed that 

[eta] “4” [^^] ”4“ [^^] d" * * * "4“ [^^‘^] I 

[a5] + [&5] -b [S]^--. -4- [6n] = [5^] >- (9a) 
etc. etc. ' etc. J 


Check 


The [as], [5s], etc., axe formed in precisely the same manner 
as [a5], [ac], etc., and the cheek relations above given must 
be satisfied by the computed values of these quantities. 

Where only two unknown quantities are involved in the 
normal equations the solution of the equations may be con- 
veniently made by any of the methods of elementary algebra, 
but if the number of unknowns is greater than two, jthe simple 
and elegant- method of successive substitutions proposed- by 
Gauss may be employed with advantage. 

The normal equations in the case of three unknown quanti- 
ties are: 


[aa] X -b [«5] y -b [ac] z -b [aa] == 0 
[a5] X -b [55] y Hb [^<J] 2 -b [5?i] = 0 
[ac ] a? -b [5c ] y d- [cc ] 2 -b [C7i ] = 0 


( 10 ) 


and from the first of these 


X = 


laay 


ac] 

aa] 


z 


[an] 

[<{«] 
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This value of cc substituted in. the second and third equations 
transforms them into 


[66 • 1] y 4- [6o • 1] 3 4- {bn • 1] = 0 ) 

= 0 ) 


[6c • 1] y 4- [cc • 1] 4“ [ca • 1] 


(11) 


in ■which. 

[6& • 1] s [66] 


{act] 

izn 

[aa] 


[«6] [co-l] = [co] 


[67i.l] = [6™]-gy[an] 



“ac] 


“aa] 

1 

“ac] 


”aa] 

[etc] 


[ao] 


N 


>(12) 


[eta] 




These equations constitute a new set of normals, from whicih 
one-unknown quantity has been eliminated. The correctness 
of the numerical work, of this elimination may Jdb tested by a 
continuation of the checks used in forming the original nor- 
mals. We introduce an auxiliary quantity 

{hS’ 1] == [66 . 1] 4“ [6c . l]‘4-[6n • 1] 

and inquire its relation to [as], [6s], etc. If w4 substitute in 
the expression Jor [6s • 1] the values of [66 ♦ 1], [6c - 1], {hn • 1] 
in terms of tlie^original coefficients, having regard to the 
relations 

[aa] 4- [a6] + [ac] + [a?i] = [a^] 

[a6 ] -b [66] 4“ [6<J ] 4- [6n] = [6s ] 


} 


(13) 


we find 

whence 
and similarly, 


[63 . 1] = [6s] - Cct6] - M I [as] - [aa] i 

C6s.l] = [6s]-g|lcaJ: 


N 


> (14) 




Cc.,..l] = M-Mi:aa] 

We may therefore obtain, a complete check upon the accuracy 
of the numerical work involved in the elimination of a?; by 
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forming the quantities [^6s- 1], in the same manner as 

[&6-1], [6c -1], [6/i-l], etc., and conqiaring the actual sums 
of these latter <xuantities with the computed check quantities. 
By a repetition of the process of elimination we obtain 

[cc • 2] 2 ! -4- [cTi . 2]= 0 Chech [cs • 2] 

■where [cc • 2] = [cc • 1] — • 1] 

Cm - 2] = [cr. . 1] [6" - 1] 

Ccs . 2] = [ca . 1] - [6s . 1] 

= [cc • 2] -f- [cn • 2] 

and we are enabled to writeJbLe^ollowmg equxYalents for the 
original normal equations, 'V 




The last of these equations gives the value of « directly, the 
second furnishes y as soon as « is known, and the first gives 
the value of x. Thq whole solution is therefore reduced to 
finding the values of the coefficients and absolute terms in 
these eliminatioji equatio7is, A conveiiieiit arrangement of the 
computation by which these quantities are obtained is given 
in the following examiple, in which the actual Computation is 
exhibited upon one page, and the opposite page contains a 
schedule correspondingly arranged showing the analytical 
equivalent of each number contained in the computation. 
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In making the multiplications of [a6], [ac], [cm], [as], by 

the constant factor the logarithm of this factor is written 

.[ac6] 

on the edge of a slip of paper, and. being held successively 
adjacent to the logarithms of [ad], [«c], sum of 

the two logarithms is taken mentally, the corresponding num- 
ber looked out from a logarithmic table and written in its 
proper place under [dd], [dc], [da], [ds], a subtraction then 
gives the value of [dd • 1], [dc • 1], [da • 1], [da • 1], and a simi- 
lar process is followed for each other derived coefficient. 

§ 10. Example. To illustrate the principles contained in 
the preceding sections, and to exhibit in detail the process of 
deriving the most probable values of several unknown quanti- 
ties which are connected with the observed quantity by a 
rather complicated relation, we select from Vol. iii. Part 1 of 
the Memoirs of the National A.cademy of Sciences^ page 58, tlie 
following series of experiments made with a 10-gauge Oolt 
gun, loaded with uniform charges of four drams of powder and 
ounces of shot, the shot ranging in fineness from No. 10 
up to hTo. 1 Buck, The purjiuse of the experipents was to 
determine the relation existing between the size (fineness) 
of the shot and’Ats average velocity over a range of 30 yards. 
The following table contains the results of the experiments, 
each velocity being the mean result of from three to six dis- 
charges of the gun. The weight of a pellet of No. 10 shot is 
taken as the unit of weight, and the velocities are expressed 
in feet per second. 


Size. 

Weisitt. 

Observed Velocity, 

No. 10 

1 

848 

8 

2 ^ 

920 

6 

4 

966 

3 

8 

989 

BB, 

16 

1000 

FF. 

32 

1017 

No: 1 Buck 

64 

1067 


By plotting these results in a curve with the weight of the 
shot as abscissas, and the observed velocities as ordinates, the 
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experiinentor reiicliod the coaclusion that the relation between 
the weight iraud the velocity V, is expressed by an equation 


of the form 


V 

I 


— sec~^ 


m 


= 0 


in -which Z, m, n, are constants whose values are to be deter- 
rained from the observations. It will be found upon triad that 
Z(, =s 700, mo = 0.28, no = 0.42, in coMection with the' ^observed 
values of V and W will approximately satisfy this equation,^ 
and we therefore adopt these approximate values and proceed 
(§8) to determine the corrections AZ, Am, An, which when, 
added to Zo, m©, no, will furnish the most probable values of 
Z, m, n. 

The several differential coefficients of the observation equa- 
tion, V =f (Z, m, n, TF) are 


dV^V 

dlff Iq 

dV 

cZ?»o 


A 

mo 



dV,^ Z„ 
cZwo M 


log W cot 


V 


in which 3/ denotes the modulus of the common system of 

p" rr 

logarithms, 3/ = 0.43429. In the factor, coty, — is the ratio of 

two numbers, and must be construed as representing a qertain 
arc expressed in parts of the radius: the corresponding arc 

F 

expressed in degrees is 67®. 2958 y. 

The form of the observation equation with which we are 
here concerned is 

— • AZ — ^cot^* Am -4- AlogTP'cot^An — ( F — Zq sec"'^^^ = 0 
Zq WIq Zq 3a . Zq ^ J 

and introducing into this equation the numerical values of 
Zq, ?Mo, Wq, V 4 Wj 3/, we find the following 
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OnSEIiVATIOX jEaUATIOXS. 


(1) 

1.29 AZ — 

T29 A 

-f- 0 A 71 

4- 53 = 

= 0 

p S=S 1.0 

• (2) 

1.36 — 

535 

H- 104 

4- 33 = 

= 0 

1.0 

(3) 

1.41 — 

39G 

-h 154 

4- 24 = 

= 0 

0.8 

(-1) 

1.45 — 

294 

1T2 

4- 28 = 

s 0 

1.0 

(o) 

1.4'8 

219 

+ ITO 

4- 38 « 

= 0 

1.2 

(6) 

1.50 — 

164 

4- 159 

-4- 37 = 

= 0 

1.1 

(7) 

1.52 — 

122 

4- 142 

— 2 = 

= 0 

1.0 


The absolute terms of these equations are residuals obtained 
by substituting in the original equation 

rrrn 

V — I seo""^ = 0 

m 

the assumed values of Iq, mo, and Wq, and the smallness of these 
residuals compared vrith the values of F", shows that' the 
assumed quantities are approximately correct vahies of I, m,n. 
This compu-^d value of V was used instead of the observed V 
in deriving the coejBGLcients of the equations. Tho memoir from 
which our data are taken contains no indication of the weights 
to be assigntKbto the several determinations of F, and in the 
absence of such information they should all be treated as 
equally precise and given the weight 1 ; but for the sake of 
illustration a slightly \Ufferent set of weights indicated above 
by 2^ has been assigned to them, and by multiplying each equa- 
tion by the square root of its weight wo obtain the following 


■Weighted Osseuvation* Equations, 


1.29 A 1 

— 729 Am 

— 0 71 

4- 53 

= 0 

1.36 

— 535 

104 

4“ 32 

= 0 

1.26 

- 352 

4- 137 

4- 21 

« 0 

1.45 

— 294 

4- 172 

4- 28 

« 0 

1,62 

— 239 

4- 185 

4~ 42 

= 0 

1.58 

— 173 

4- 167 

+ 38 

0 

1.52 

— 122 

'4- 142 

— 3 

« 0 
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The coefRoicnt-s and al isolate terms in these equations are of 
very different mnjjnitudes, and to simplify the subsequent 
numerical \vork we divide each equation through by 100 and 





z 



and introduce x, y, z into the equations in place of Al, Am, A?i. 
This step, which is frequently called rendering the equations 
homogeneous, furnishes the following 


■ 

Homogesteous Weighted Obsebvatioh Equation's. 

0.796ic - lOOOy + O.OOO 2 + 0.530 = 0 s = + 0.326 
0.839 - 0.T33 +0.563 + 0.320 = 0 .0.989 

0.777 - 0.482 + 0.741 +0.210 = 0 1.246 

0.895 - 0.403 + 0.931 +0.280 = 0 1.703 

1.000 - 0.327 +1.000 + 0.420 = 0 2.093 

0.975 - 0.236 + 0.903 + 0.380 = 0 2.022 

0.938 — 0.167 + 0.768 — 0.020 = 0 1.519 



EXAJirr-K. 



Tile values of s = a + /> + r -}- >i, whicli are to !»• iisnl as a 
check ill tlio formation of the uoniial fiiiiations, aiv iloriviij 
from these equations. 

The formation of the coetlioieuts of the normal iMiiialioiiH liy 
the use of a table of sqnart'.s, Bmeh mctiioil, is in 


the following tables : 


Sum OP TUB CoKPrioiKm 


SpatiOR 

d’^h 

a+0 


a+ s 



1 

f i » 

0ft 

1 .... 

0.204 

0.796 

1.326 

1.122 

1.000 

!• Ill lf*>» 

0.470 

* 1 * 

0.1174 

li,J ^ *«K _ 

0.330 

0.320 

2 . . . » 

.106 

1.402 

1.139 

1.828 

.171) 

.413 

0.250 

0,«H3 

1.352 

3 . . , , 

.296 

1.518 

0.987 


.239 

.272 

0.764 

0051 

i 

1967 

1 

X • t • * 

.492 


1.175 

2.698 

.528 

.123 

1.31)0 

1.211 

3.0.14 

5.... 

.673 

2.000 

1.420 


.073 

.O'.l.'l 

1.760 

1.420 

3,093 

G. , 

.739 

1.878 

1.355 

2.907 

,067 

.1(4 

l.7rt0 

I.2H3 

9,923 

T... . 

.771 

* 

1.706 

0.918 

2,437 

.001 

.1H7 

1.352 

0,74R 

2,2«7 
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From the sums of the squares contained in the several coh 
tiiiins of this table the coetticients [<i6], [ac], etc., are computed 
at the foot of the columns by the relations 

[ab] - i J [ (a + 5)^] -- C[a^] + [5^]) ], etc. 

The check quantity [as] is compared with 

[act] "f“ [ct&] -j- [^^^] “t" [a^t] 

whose value is written immediately under [as], and which 
must agree with [as] within two ox three units of the last 
decimal place. Every coefficient of the normal equations 
enters into one or more of these sums, which therefore furnish 
a complete test of the accuracy of the work in passing from the 
homogeneous observation equations to the normal equations. 
We now write the 


Foriuax Equations. ' 

‘-f 5.57.6 X - 2.861 y + 4.480 « 4- 1875 = 0 
■ - 2.861a 4- 2.122 y - 1.813 z - 1.200 = 0 
4.480u; — 1.813y 4^ 4.1382 4“ 1348 =0 

It may'lSe^een from an inspectiou of these equations that 
the data upon which they are based will not furnish a good 
determination of the valnes of all the unknowns, for if the 
first equation be divided by — 2 the quotient will be very like 
the second equation, and if it be multiplied by 4‘-| the product 
will be very like the third equation. We proceed, however, 
with the solution by Gciuss^ method, which will furnish the 
best results that the data can he made to yield. 
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Solution of the !N’ormal Equations. 


[aa] 
log [«a] 

[aft] 
log [aft] 

['«••] 

log[«c] 

[on] 
log [on] 

[as] 
log [as] 

[««] 

[66J 

[aa]^ ■* 

[6c] 

[aaj 

[6n] 

i:«‘][o„] 

['"‘J 

pw 


[65 . 1] 

’H 

log [66 • 1] 

[ftc . 1] 

log [ftc • 1] 

[ftn • 1] 

% 

log [ftn ^ 1] 

[6s . 1] 
Oheck sum. 
log [6s • 1] 

loglffl 


[cc] 

[«a]L 

[cn] 

[cs] 

log [fa -1] 

® [66 . 1] 


[cc • 1] 

[ftft*!]*- -* 

[cn • 1] 

P^*^^[ftn.l] 
[ftft-lj*- • 

[cs • 1] 
Check sum. 

^^[6s*n 
[66 •1]'-, ■' 

log [<=" • 2] 

® Ccc.2] 


[cc • 2] 

[cn - 2] 

[cs • 2] 
Check sum. 


log [cc • 2] 

. > • 

log [cn • 23 



ELiMiiq-ATiosT Equations. 


X + 



y 


+ 

+ 


[aa] 

lil 

• 1 ] 




z 


+ 

+ 

+ 


JM 

[aaj 

[fa- 1 ] 

[M-l] 
[cn » 2] 
[cc - 2] 


= 0 
= 0 
=r0 


The course of the computation after the formation of the elimina- 
tion equations is sufficiently indicated upon the opposite page. 
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SOLUTIOI?r OF THE alSORilAL EqTJATIOSTS. 

+ 6.576 —2.861 ++480 + 1.875 +9.070 

0.7464 0.4566 n 0.6513 0.2730 0.9576 

+ 2.122 — 1.813 - 1.200 — 3.752 

9.7102 ri +1.468 —2.299 —0.962 —4.664 



+ 0.664 

+ 0.486 

— 0.238 

+ 0.902 


9.8156 

9.6866 

9.3766ji 

9.9652 

9.9049 


+ 4.188 

+ 3.599 

• + 1.348 

+ 1.607 

+ 8.163 

+ 7.286 



+ 0.539 

— 0.169 

+ 0.867 

9,8710 


+ 0.361 

-0,177 

+ 0.670 



+ 0.178 

+ 0.018 

+ 0.197 

9.0049 


9.2504 

8.2353’ 



+ 0.902 


+ 0.866 


+ 0.X96 


Elimination^ Equations, 


X — 

■ 0.513 y + 0.803 « + 0.336 = 0 
y + 0.743 a — 0.364 = 0 ' 
a +0.101 = 0 

X — 0.030 

y = + 0.489 
a = -0.101 

logx 

log'0.0162 

Al 

h 

1 

8.477111 

8.2095 

-1.8 

700.0 

608.2 

log y 4 9.6426 

log 7.29 0.8627 

Am +0.0602 

wzq + 0.2800 

in +0.3402 

log z 9.0049 n 
log 1.85 0.2672 

An -0.0647 
nQ + 0,4200 
n + 0.3653 
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If with the values of 7, m, n thus obtained the correspondino* 
velocities bo eoiiiputotl by luoaiis of tho original ec^uation 



m 


the resulting residuals should he smaller than those derived 
from the substitution of Iq, mo, jiq, i.e., the absolute terms of 
the observation equations. The following comparison of these 
residuals shows a much better representation of the observed 
values of V, especially if the sums-of~-the squares,- [w], be 
compared. ” 

Observed — Computed V, 

Weight of Shot 1 2 4 8 16 32 64 

/Qq mo no) — 63, — 32, — 24, — 28, — 38, — 37, -f- 2 

/O, m, n) — 6, + 10, + 13, +4, — lo, — 13, + 22 


itlot only are the residuals diminished, in magni-fcude, but 
their distribution is much more nearly in agreement with the 
law of error. 

The values thus obtained for 7, m, n ought not to be con- 
sidered* the best attainable, since the corrections Am, An are 
relatively large fractions of m© and rio, and it is probable that 
the neglected terms containing Am^, An^, etc., have an appreci- 
able influence upon the solution. To secure the utmost accuracy 
these values of I, m, n should be treated as new approximations 
and another set of corrections A I, Am, An derived. This re- 
solution is recommended to the student as a valuable exercise. 
Let the student also derive from the data of § 1 the most 
probable values of 7© and c, assigning unequal weights to the 
several equations. 


§ 11 . Conditioned Ohsei;vations. There is a class of cases in 
which the application of the principle of least squares seems 
to produce absurd results. Thus if each angle of a plane tri- 
angle be measured many times in order to obtain an accurate 
set of values for the angles, the application of the principle 
that the [^pvv'] must be made a minimum will furnish as the 
most probable value of each angle the weighted mean of the 
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measures of that angle, but the sum of these -weighted means 
■will usually dififer slightly from ISO®, and since the sum of the 
angles of every plane triangle must equal 180° it ai:)pears that 
the most probable values above derived are impossible values. 
It must, however, be noted that the method of treatment above 
outlined is itself a violation of Principle A, § 1, since the knowb 
edge that the suni of the angles must equal 180° furnishes a 
relation among those angles ■which may be used and ought to 
be used in determining their most probable values j and the ap- 
parent absurdity above found is produced by nieglecting this 
part of the data. 

A relation such as the above ■which must be exactly satisfied 
by a set of observed quantities is called a rigorous condition, 
the equation by which the relation is expressed is called an 
equation of condition, and observations of such quantities are 
known as conditioned observations. The n-umber of rigorous 
conditions is, of course, -always less than the number of -un- 
known quantities, since if it were equal to the number of such 
quantities the values of the latter would be determined hy the 
conditions alone, independently of any observations. 

In order -fco develop a convenient method of^ treating rigorous 
conditions, let a?, y, z be three unknown quantities which are to 
be determihld'^oin observation, but whose values are req-uired 
to satisfy the equations of condition 

(a;, 2 /, 2 ) = 0 i/r (a?, y, ») = 0 

Let the measurements or observations for the determination of 
the unknown quantities be represented by observation equa- 
tions of the form 

' fi(^> ~ 9 -^0 “ 9 (?) “ 9 

m, n, ^nd q being the quantities directly measured, and the 
measures for the determination of a? being quite independent 
of those, for y, z, etc. In accordance with the principles of 
least squares the valxxes of the unknown quantities are to be so 
determined that [pvu] shall be made a minimum in each series 
of observations above, represented, and therefore the sum of all 
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the 'freighted ?(*[iiares of the residuals must also be a minimum.. 
Owing to tin* <*oiiditions y, z) = 0, y, z) = 0 it will not 
iu general be possible to assign to the unknown quantities 
values wliich will give to [i;w] its least possible value^ and the 
problem becomes one of conditioned or relative minima, i.e.. 
out of all the sets of- values of a;, y, z which, will exactly satisfy 
the equations of condition it is required to find that set which 
assigns to [^pvv'] its least value, consistent with those equations. 

The method of determining relative minima is follows: 
(Jordan, Cours Analyse, Vol. i.,.§"205). Multiply each equa- 
tion of condition by an undetermined constant factor, and add 
the products to the function which is to be made a minimum. 
The derivative of the new function with respect to- each xm- 
known quantity must be placed equal to 0, and .the equations 
thus formed, together with the equations of condition, will be 
just sufficient to determine the unknown quantities and the con- 
stant multipliers. Thus, in the present case, representing the 
multipliers by — 21^^ and —’2Tc^ we have for the new function 

V 3 = \jpvv~\ — 2ki<jy (x, y, z')~-2 y, z) (17) 

and ^=0 ^=0 ^ = 0 ’ (18) 

dx dy dz 

2/; y, 2)-= 0 

will determine Jci, Tc^ x, y, and z. 

It was shown in § 7 that in general for three unknown 
quantities, 

— = 2[>ao] a + 2[pa5] y -f 2[j3ao] a + 2CpanJ 

but in the case here considered those observation equations 
which contain x do not contain either y or z, and, therefore, the 
6 and c coefficients in those equations are to be considered 
zero, all the products ad, ac, are also zero, and 

d^pyv^ _ + 2\^pan'] 

cZflj 

with similar expressions for the y and z derivatives. 
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Denoting for the sake of brevity y, z') and y, z) 
by eft and ip respectively, we obtain by clilfereutiating iv 


= Lpaa2x + Zpan2 ^ = 0 

from which 

[paw] d<l, fc, d,/, kt 

[_pcLCL^ dx \_paa] dx [_paa} 

y ^ CP^^H I ^4* ^ ■ dip ^ Jc^ 

[p66J dy ^ dy [p66] 

fei . dxp Tcq 

[peej dz \_pcc] dz [peej 


. These equations determme the values of x, y, z, when Tcj 
Acjj are known,- and it should be observed that the first terms of 
the second members of the equations are the values of aj, y, z, 
which would be obtained by treating the observations as if 
these quantities -were entirely independent of each other, e.g. 
in the case of direct observations of the quantities they are 
the weighted means of the observations. If we represent the 
values thus obtained by aJo? Vot represent by Vj, Vj, the 

corrections which must be added to these . quantities in order 
to obtain the most probable values of a?, y, z, i.e. put 


05 = ajo + 'Vi 
we shall have 


y ==yo + Va 


« = »o-f % 


• 

dx 

[paa] 

H — 
dx 

\_paai 

d<f> 



Tcq 

' dy' 

[j)663 

^ dy 

Ci>66] 

d<f> 

CSSZ • ' * 

fc, 

r ..nr 

+ ^- 

r 


\ 




dz ^ dz fpcc3 J 


( 19 ) 
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The quantities and h called oorrehitcs, and from the 
manner in which they were introduced it appears that the 
mimber of correlates is equal to the number of rigorous con- 
ditions to which the observed quantities are subject. To 
determine the values of the correlates let cpo-I- 2/o + i'a? 
be substituted for a?, y, z in. the equations of condition, and 
the equations be developed by Taylor’s Formula, giving 


and a similar expression tor y, z)'. Let theu values of 
^ 2 ? ^8 in terms of and be substituted in these equations, 
and put 

and the equations become 


(2()) 


[ -j- [ <iP~\lc2 "-I- <!> (pCQ, «o) — P ) 

= 0 ) 


( 21 ) 


^ (P'03 y^3 

from which the values of hx and ^ may be obtained, and thus 
the values of Vg, from equations (19), which may now be 
put in the following mdre* convenient form: 

V [ ‘^2^1 + 

V [ ] ^8 = <=^3^1 + 


The method by which the above equations have been derived 
for the case' of three unknown, quantities connected by two 
equations of condition is perfectly general and may be ex- 
tended to any other number of quantities whose values are 
to be obtained from independent observations. In the cas^ 
which actually arise in practice the observation equations and 
equations of condition are usually of simple form, the differen- 
tial coefficients and the quantities a, c, etc., being usually 
equal to either 1 or 0. 
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ProbijE^i- Xict the student show hy the method of corre- 
lates that if the sum of the measured anjjles of a plane triangle 
exceed 180° by a quantity e, the angles must be corrected by 
distributing e among them in such a manner that the correction 
to each angle is inversely proportional to the weight of the 
angle. 

To illustrate the application of the principles of the present 
section to a numerical problem, we select from the XT. & 

&, Purvey Report for 1884, pages 409 et $eq., the following tele- 
graphic determinations of longitude, and seek to adjust them 
so that they shall be mutually consistent. Each diference of 
longitude between two stations was directly observed, so that 
the observation equations are all of the form x = Wi, co = 
etc., and the values given below are the weighted means of the 
individual observations of each series. The probable error of 
each determination (see §..12) is placed immediately after the 
quantity itself, and the weights of the determinations axe 
assumed to be inversely proportional to the squares of* the 
probable errors. 


Statlous- 




Symbol. 

Observe^ 
Difference of 
Xonsritude. 

1 

v(p 

2^ 

Cambridge, Mass., 
■VVasliington, D.C., , 


^0 

23 ”" 4 : 1 ? 041 '± 0?018 

0.18 

0.032 

Cambridge, Mass., 
Cleveland, 0-, 

!• • • 

Vo 

4:2 14.876 ± 0.038 

0.38 

.144 

Cambridge, Mass., " 
Columbus,- 0-, J 

1 • ■ 


4:7 27.713 ± 0.035 

0.35 

.122 

"Wasliington, D.C., ' 
- Columbus, 0., j 

r ■ 

iSi 

23 46.816 ± 0.038 

0.38 

.144 

Cleveland, 0., ' 

Columbus, O - , J 

1 • • 

Wq 

- 5 12.920 ±0,045 

0.45 

.202 


The five observed difierences of longitude givd rise to 
rigorous conditions represented by the following equations o 

condition. : 
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The coefficients in the observation equations being all equal to 
unity, 

—Py> etc., 


and 


“1 


cl<f> 

cLv 


“2 


d4> 






etc., 


Vi?x^ VPiT ~^P2. 

and from these expressions are derived the following v^ues of 
the coefficients, together with the siiins jti = 0-2 = oa+A? 

etc., which are to be employed as a check upon th^formation 
of the normal e'quations for determining the correlates. 


COEFPICIEN'TS. 


Sulisoripts. 

1. 


3. 

4. 

5. 

a 

+ 0.18 

0.00 

— 0.35 

+ 0.38 • 

0.00 

3 

0.00 

+ 0.38 

— 0.36 

0.00 

0.4:0 

s* 

+ 0.18 

+ 0.38 

— 0.70 

+ 0.38 

+ 0.45 
. 


IFORaiATION" OF THE COBEELATE EqUATIOXS. 


aa 

ap 

as 

PP 

Ps 

+ 0.0324 

.0000 

.1225 

.1444 

.0000 

+ 0.0000 

.0000 

.1225 

.0000 

,0000 

+ 0.0324 
' .0000 
.2450 

.1444 

.0000 

+ 0.0000 

.1444 

.1225" 

.0000 

.2025 

0.0000 

.i444 

-2450 

.0000 

1 .2025_ 

+ 0.2903 

Clieck. 

+ 0.1225 » 

+ 0.4218 
0.4218 

+ 0.4694 

+ 0.5919 

0.5919 


COBBEEATE IfTOBMAE EQUATIONS. 

-f. 0.2993 *1 4- 0.12257C2 + 0M44 = 0 = - 0*-^^ 

4 - 0.1225*1 4- 0.4694*i + 0 .091 = 0 ^ — 0 .OTS 
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The absolute terms of the correlate equations dre obtained 
y substituting the observed values rro, yoj in the equa- 

tions of condition, and the values of ku ^2 may be found from 
t re correlate equations, either by G-auss^s method of substitu- 
ion or by any of the ordinary algebraic processes of elimina- 
tion. The corrections to etc., and the adopted values 

of the unknown quantities, are now found from 

'yi = + 0.032 + 0.000 A :2 = — 0*.014 x = 23“ 41*. 027 

^2 = + 0.000 7Ci_ -{- 0.144*2 = — 0.011 y = 42 14 .864 

0.122 *1 — 0.122 *2 « -b 0.064 2 = 47 27 .777 

^4 = + 0.144 * 1 4 - 0.000 *2 ==s — 0.065 u = 23 46 .751 

^5= +0.000*14-0.202*2= -0.016 it; = 5 12.913 

The values thus obtained satis:fy the rigorous conditions of 
the problem, and are the rtiost probable values which can be 

obtained from, the data given abovei 

’ \ 

§ 12. The Probable Error. Every intelligent observer 
desires to know something of the quality of his observations, 
how good or how bad they are; the computer who ha^ to com- 
bine the results of di:fferent series of observatijbns should have 
some knowletfee-jQf their relative accuracy in order to assign 
to each series its proper weight ; and the investigator engaged 
in a complicated series of experiments desires some criterion 
by which to estimate the relative errors of the several parts 
of his work, in order to properly apportion his care among 
them, giving the maximum attention where the greatest errors 
are to be feared. It is evident from the nature of the case 
that no absolute criterion of this kind can he famished,, since 
any series, 5 f observations maly** be affected with systematte 
errors which seriously impair the accuracy of its results bat 
furniah-no indication of their presence. Both •observear and 
computer do, however, estimate the accuracy of observafeons 
by their agreement among themselves, and that within certain 
limits this procedure is correct follows from law of 

error. If we suppose a veiy long series of observatioBS 
affected only by accidental ermrs, the values of the nnlrn own 
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quantities outairucd from the series will differ but little from 
the true values (^if the series is intinitoly loni^ they will be 
the true values), the residuals which they furnish will be very 
nearly the errors of observation, and the value of A in the 
eqi^ation of the error curve will furnish a measure of tlie 
precision of the observations as well as a measure of tii® 
smallness of the residuals. On the other hand, if the student 
attempts to construct the error curve corresponding to aiqr 
short series of residuals, e.g., those of § 10, he will find 
while they give him some informatibh'iic' regard to tbo emwe 
there will be much that is arbitrary in its actual conStirtMitMm, 
and that many curves can be drawn which will to fib 

the residuals equally well, i.e. the amount of data in tbia oao* 
is insufffeient to determine more than a rough apparoyiio ntMio ^ 
to the measure of precision of the observations. If tlio olnao- 
vations are affected with systematic errors, the residiials 
be very different from the errors of the observatknui^ ami wul 


then furnish no indication of their accuracy. 

It thus appears that any conclusions in regaj pfi ^ 
curacy of a given set of observations must be ****f^^T^^ 
caution if they are based solely on the residuals fkimi^b» 
the observations. Such conclusions are, in 
within certain limits whose general nature 
but within these limits the information thus fianu^ 
of much value, and it is frequently employed or jpnxpoMw 

indicated at the beginning of this section. ^ w am 

The measure of precision, ib, 
name as the. appropriate means ^ I, 

accuracy of a set of observations, but 
used, another function of the residuals being ^ 

be arranged in the order of iwi* 

place in the series wiU hsTe » W 
L there are less than it, and m «ny ftrtw ~ 

tions of the same de^ of wM fc* 

it will be an even chance that any givea 
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greater than., or less than, the micUlle one al)ovo selected. This 
middle residual is usually denoted by r, and is rather inappro- 
priately called the probable error of the' series, the adjective 
having reference to the equal probabilities of the occurrence of 
residuals (errors) greater than, or less than, r. 

It is apparent that the greater the precision of any set of 
observation, the smaller will be the corresponding probable 
error, but the exact relation which exists between h and r 
must be derived from the equation of the error curve. The 
symmetry of this curve with respect to the axis of y shows 
that the same law of distribution holds for both positive and 
negative errors, and that in a very long series of residuals the 
probable error r will occupy the middle place among the 
positive errors and among the negative errors considered sepa- 
rately, as well as among all the errors taken without regard . 
to sign. Since we are concerned only with the numerical 
magnitude of r we may confine our attention to the positive 
residuals, and find the relation between r and h from that half 
of the error curve which lies to the right of the axis of y. 

Since the probable error is a residual, it must be represented 
-by the absciss^ of some point on the axis c/f a?, and we may 
determine t^is^_ppint from the condition that the ordinate 
draiwn through it bisects the area of that half of the curve 
under consideration, since (from the relation between areas 
and the number of residuals of a given magnitude developed 
in § 4) this is the geometrical equivalent of the statement 
that the number of residuals greater than r is equal to the 
number less than r. By interpolation from the table in § 4, 
the value of the argument corresponding to .4 = 0.25 is found 
to be hx hr = 0.477, whence tSe relation between the proba- 
ble error and the measure of precision is 


y = 


0.477 


( 22 ) 


The student will observe that in the definition of the probar- 
ble error reference is made to a very long series of observations,' 
and in a series of infinite length the value of r might be found 
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immediately fn.nu its definition, but in any ordinary set of 
observations it is bt^tter to assume that the residuals are dis- 
tributed in aeeordiinee with the law of error, and to determine 
the value of r from the relation between h and the sum of the 
squares of the residuals, § 5, which gives 

0:477 V2^M 

"We here encounter a difficulty arising fron\the attempt to apply 
to a short series of residuals principles which af'e rigorously true 
only when the series is of infinite length. Suppose the above 
expression for r applied to a series of three observations involv- 
ing three unknown quantities whose values are derived from 
the resulting observation equations. These values will exactly 
satis^ the equations, no matter what the errors of the obser- 
vations may be, and the residuals being all zero, there will be 
found r = 0 and A = 00 , which. is absurd. The observations in 
this case furnish no data from which to estimate their pre- 
cision, and in every such case where the number of observa- 
tions is equal to the number of unknown quantities, the 
expression for the probable error ought to become indeter- 
minate, It is therefore customary to put 

r = ± 0.674 \|I3 (23) 

in which /* denotes the number of quantities whose values have 
been derived from the observations. This equation, which is 
known as Bessel's expression 'for the probable error of a sin- 
gle observation, being only an approximate one, we may usually 
put ^ in place of the coefficient 0.674. Among German physi- 
cists and astronomers, it is quite customary to suppress this 
coefficient altogether, and* to use the ^^mean error" 

•* = ± (24) 

for the comparison of observations.. Geometrically considered, 

€ denotes the abscissa of the point of inflexion of the error 
curve. , 
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A simpler expression for the i->rol)able error may be obtained 
by substituting in the er[uation 

,, __ 0.477 
h 

a value of 7i derived as follows : Let each member of the equa- 
lou of the error curve be multiplied by xdx and integrated 
between the limits — oo and H-oo, giving 

I xydx=i.—— \ xe-^^^^dx 
Vor*^"** 

The value of the first integral in this equation is obviously 0, 
since as we pass along the error curve from — oo to -j-oo every 
value of y occurs once associated with a negative value of x, 
and again with a numerically equal positive value, and- for 
every negative element xydx in the integral there occurs an 
6qual positive xydx so that the entire sum is 0. If, however, 
we agree to neglect the sign of x and to consider only its 
numerical value, we shall find 

* /'4-« 

. j xydx = 2 1 xydx 

and by a course /of reasoning precisely simila/ to that applied 
/in § 5 to the xfuairtity I o^ydx, it may be shown that 2 I xydx 

• ■ 1 •» */-co */o 

is equal to the mean of all the residuals taken without regard 
to sign. We may therefore write 


where the + inside the brackets denotes that all of the resid- 
uals are to be treated as positive quantities. Putting Aa? = i 
.in the second member of this * equation and. remembering 
that here also we are concerned only with numerical values 
of X without regard to sign, we obtain 






n AVtt 


Introducing the limits into the integrated expression there 
results C+v]_ 1 



50 


THE :METH0D of liEiVST SQCTAEES. 


and ,.=,0._t77V;L+:J (2«) 

71 

TKis formula is rigorously correct only when, the number of 
observations is infinite, and it must be traiisforinecl^ so as to 
become indeterminate when the number of observation equa- 
tions is just suffi-cieiit to determine the unknown quantities, 
i.e., when n = ju. This might be accomplished by writing 
n — fj. in place of 72 , as was done in e quation (1 9), but it is 
customary to substitute in this case -^/w.(7^•— /x.), which also 
renders r indeterminate when n = /*, and gives values of .r more 
nearly in agreement with equation (23) . JMaking this substi- 
tution, we have 

r= ±0.846 ; (26) 

— fi) * 

which, when /«,=!, is known as JPeters^ formula for probable er- 
rors. This formula is very convenient for the numerical compu 
tation of probable errors, but where the number of observ^ioim 
is small the results furnished by equation (23) are considered 
more reliable, but neither formula can furnish a good determina- 
tion of probable errors from a small number of observa 
The numerical application of these formulse may ^ ^ 

trated by the following short series of sextant observations or 

the determination of latitude. 


Mean = 4S 
n=l3 


irvatioTxs 

V 

vv 

'4' 46" 

19" 

361 

4 24 

3 

9 

4 7 

20 

400 

4 28 . 

1 

1 

4 59 

32 

1024 

4 39 

12 

144 

4 52 


625 

4 52 

2& 

625 

3 47 

40 

1600 

4 15 

13 

144 

3 36 

51 

2601 

4 40 

13 

1^ 

4 *27 C+®1= 

253 

|;w3 = 7703 


I 



logC+ ©] 
a,c., log Vn(7i — 1) 
log 0.8-15 
logr 
r 

iogC®»3 
log<» — 1) 

losM 


log 


J [ggj 


logO.874 

k^r 

r 


3-403 
8.940 — 10 
9.926—10, 
1.269 
± i8«.e 

3.S86 

1.041 

3.845 

1.42S 

9.S»— 10 
1.251 
± 17*^^ - 
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lie clitTerence betweeii the values of r found from the first 

aric second powers of the residuals is small compaued with the 

unceitainty of each arising from the small number of obser- 
vations. 

^ In so far as these observations can be considered as furnish- 
ing a value of r, they indicate that in a future series of similar 
a^d equally precise observations, there should be as many 
o servations furnishing residuals (errors) greater than 18^' as 
there are observations giving residuals less than 18". The ± 
■^hich is commonly prefixed to the numerical value of t, denotes 
that the observed quantity is as apt to err in excess as in 
defect. 

Let the student derive from the residuals given in § 10 a 
determination of the probable error of an observed noting 
that in this case ^ = 3. 


§ 13- Probable Error of a Ptinction of Observed ftnantities. 
Let oj", a"' denote quantities which have been determined 
from observation, and let r', r", be their probable errors. 
Let w be a quantity whose value has been computed from the 
values of x\ a", a'" by means of the relation / 

. u=/(fl5', a" a?"') . . 

It is evident that the precision with which w is determined, 
depends upon the precision of x', a", a"', and by a slight exten- 
sion of the term “probable error” we may consider the pre- 
cision of u to be represented by a probable error, r, and may 
inquire the relation of r to r', r", r"'. 

Since a probable error is one of ihe residuals or errors of a 
very long series, we may. obtain ^the desired relation between 
Tj r^j r”, from a consideration of the general relation of 
set of errors i?", v*", in a;^, ^ to the eorrespondfog 

error, .'y,-ia u. This relation is 


■ ' V = 


das* da:** 




dtt 


V*** -J- etc. 


(see § 8). To avoid the necessity for considering the signs of 
V*, v**, v"*, let this equation be squared, giving 
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IH2 


from wliicli all terms involving the products v'v''j 
etc.j have been dropj)ecL for the reason that the probable error 
of xo depends upon the average magnitude of v, and in the long 
run. any pair of residuals 'u', will have opposite signs as 
often as they have like signs, and will therefore produce an 
equal number of positive and negative terms whose effect upon 
tlie mean value of will be very sm^ compared* with the 
terms containing which are always positive. Re- 

placing these actual errors by the corresponding probable 
errors, we obtain 

and. an equation of similar form will express the relation of 
tlie .probable error of the function to the probable errors of the 
quantities upon which it depends, whatever the number of 
tlaese quantities may be. 

VV 6 proceed to apply this relation to a few simple cases o 
frequent occurrence in practice.- 

(a) The probable error of the sum of n observed quantities. 


In this case u = a?' + + as"' -f- ... + aj* 

and each of the differential coejBS-cients etc., equals 1, 

whence ^ = + + **• (^) 

Xhe probable error of the meam of w ol^rved quantities. 

In. this case u = -(a/ + ’•* 

n 


(ia/ ** 


= i (r' * + r'" -h * 4- • * • 
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We have here to distinguish two cases. If the x^s are all of 
equal precision, the r’s are equal, and may be represented by a 
common symbol )\ j whence 



(29) 


If the observations are of unequal precision represented by 

ft ^ . 


weights 


P'l P"i 


we have 




M 

^ = ^ — ^ etc 

c?a!' O] da:" [p] 






r =s ± 




V[>] 


(30) 


where rj denotes the probable error of an observation whose 
weight is 1. * ■ . / 

The relations here derived between the probable error of a 
single deternAnaiTon of a quantity, and the probable error of 
the mean of n determinations, may be employed in connection 
with equation (23), to determine the probable error of an 
adopted value based upon several determinations of a quantity. 
Thus, in the general case of observations of unequal weighty if 
Vi represent the probable error of an observation of weight 1, 

. and r the probable error of the weighted mean, we have from 
equation (23), 

ri = 0.674 


VO] 

Let the student show that urhen the observations are of 
equal precision and. 
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i, == ff 

It = sm — 
a 



r 

r 

r 


= ± ft's/ Hr* 

x' T* 

= ± cos— 

a a 


= ±0.434-{A + ^JV 


§ 14. Assignmeat of Weights. Rejection of Observ’ations. 
The term weight has been employed in the preceding sections 
as a measure of the quality of an observation, but its use is by 
no -means limited to the case of single observations. Thus, 
from an Jnvestigation of t7ie Distance of the etc., by S. 

2^e%ccomh, we select the following determinations of the solar 
-parallax. 


]MCetnodL by wlxiclx determinod. 

Parallaat:. 

TVeiiTbt. 

Meridian observations of Mars, 1862. 

8^'.855 

25 

Micrometric observations of Mars, 1862. 

8.842 

6 . 

Parallactic inequality of the Moon. 

8.838 

IS 

I/Unar equation of tbe Earth. 

8.809 

3 

Transit of Yenus, 1769. 

8.860 

6 


Each value of the parallax here given is the final result of 
an elaborate discussion of many observations, and the weights 
indicate the relative excelleiice attributed to these results by 
the author of the investigation. If tt denote any one of these 
values of the parallax, p its weight, and x© the most probable 
value of the parallax, we shall have 

..r,=.^d = 8".847 (§6) 

[>3 

It is to be noted that this value depends upon the weights 
assigned to the individual determinations, and that by prop- 
erly selecting the weights, tt© may be made to assume any value 
whatever between the least and the greatest single determina- 
tion. Thus if the weight 100 be assigned to the value 8" 800, 
and to each of the other values the weight 1, we shall find 
TTo = 8'*. 859, while a weight 100 for the value 8''.809 with a 
weight 1 for each of the others, makes «(, = S'^811. Between 
these limits the value of depends upon the judgment of the 
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computer in. assigning "weights, and this determination o£ 
weights is one of the most delicate questions that arise in the 
.application of the method of least squares. 

A relation between weights and probable errors may easily 
be established, which is frequently of service in that it enables 
the problem of weights to be stated in a different form. Let x 
denote an obseiwation whose probable error is r and whose 
weight is 1, and denote by x', x'^j x''\ etc., observations or com- 
binations of observations of the same quantity, whose weights 
and probable errors are represented by 

etc. In accordance with the definition of weights, x^ is the 
equivalent of p' observations of the same quality as x^ and 
from the equations derived in the preceding section we have 




with a similar expression for each of the other quantities ; 
whence 

p V* — 2 _ piiiynt2 ^ 


and 


„f._ d"— — — 


.(82) 


and, in general, tlie weights axe inversely proportional to the 

squares of the probable errors. ^ 

It has been sufficiently shown ..that probable errors derived 
from the residuals furnished by a series of observations repre- 
sent only the effects of accidental* errors of observation, ut we 
may extend the significance of the term so as to include ^ 
estimate of the effect upon a,", xf’> of systematio^rs ^ 
the observations. Let n and '.wt represent those parts of the 
probable error which come from these, two sources respeetivoly, 
a-n 'd from § 13 we find for their combined effect 

and the expression for the weights becomes 

*■ 4 - T ' 
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By this '-levioo the tie tor mi nation of weights is retliicetl to an 
estimate ot' the euinbined eifect of muhtloLitiil and systematic 


errors of observntiou. upon the quantity whose weight is 
desired, and it was from an estimate of this character that the 
weights of the jiJxrallaxes given above were derived. 

If ?•' denote the probable accidental error of a single obser- 
vation, and the quantity whose weight is p is the mean of n 
such observations, we shall have 


p = 



( 34 ) 


from which all constant factors have been dropped, since 
only relative values of p are ever required. It appears from 
this equation that if the systematic errors, 7*j>, axe very small 
compared with the accidental errors, r', the weight increases 
rapidly as the number of observations is increased, but if the 
systematic errors are large, the weight is but little affected by 
the number of observations j a relation to be considered in 
deciding how many observations sh^ll be made to determine 
an unknown quantity. 

In some cases it may be impossible to form any reliable 
estimate of the effect of ‘systematic errors, and results which 
have been derived by different methods, or under different cir- 
cumstances, may then be given equal weights on the supposition 
that they are affected by different systematic errors which it 
is equally important to eliminate j but this is equivalent to 
putting r2 = co. in the equation for the weights, and it will 
rarely happen that this is the best estimate which can be made 
for the amount of the systematic errors. 

It frequently happeiis^that in a series of otherwise accordant" 
observations, one or two will be found which differ widely fpom , 
the others, and which if included in the final result will 
furnish large residuals. What shall be done with olservainons 
of this kind has long been a vexed question. To reject them 
•is equivalent to assigning to them the weight 0, and is the 
expression of the computer’s judgment that they can contribute 
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notliing- to the accuracy of the result which he seeks to obtain. 
Xu an infinitely lonjj series of observations, errors of any finite 
magnitude may be permitted without iinx)airing the accuracy 
of the final result, and the existence of such errors seems con- 
templated by the theory which we have adopted, since the 
eq^uation of the error curve gives finite values of y for all 
values of a? between, the limits — co and -h co . But in the 
actual case which arises in practice where a result must be 
obtained from a comparatively small number of observations, 
a single one of these, if affected with a large error, may make 
the final result farther from the truth than any one of the 
other observations. On the other hand, cases are by no means 
unknown in which a single discordant observation in a series 
proves to be nearest to the true value of the quantity soughf^ 
the others having all been vitiated by- some common cause j 
and between these extremes an infinite variety of cases may 
be found. It must in general remain a matter of doubt 
whether a given discordant observation shotdd or should not 
be rejected, and the decision made by the computer must' be 
his judgment based upon all the data available as to whether 
more will be gained by rejecting than by retaining it. A knowl- 
edge of the way in which observations are made, of the circum- 
stances attending^the particular observation in question, the 
magnitude of the errors which may reasonably be expected 
with the given observer and apparatus, or instrument, are 
elements which should be included in this judgment 5 and the 
observer will greatly facilitate its formation by making copious 
notes at the time of observation of all circumstances which in 
his opinion may affect the quality of his work, and particularly 
by noting any abnormal circumstances sheeting a single obser- 
vation or a part of the observations. 

A doubtful observation should be rejected if it is the com- 
puter's deliberate, judgi^ent that its retention will hurt more 
than it wiU help his final result, but it is never legitimate for 
the computer to suppress" an observation. A rejected observa- 
tion should be included in the statement of his data, and may ^ 
properly be accompanied by an explanation of the reasojm for 
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• * 

^ejootioTi, ill order that any person into rested in tlie result 

jndgmeut of the data and the manner in 
1 C 1 they have been, discussed, and may, if necessary, re- 
1SCU5S the observations in accordance with that judgment, 
e conclusion of the whole matter of assigning weights to 
nunxerical data may be siimmed up in the statement that no 
ma ematical expression will suffice for this purpose, but the 
weig ts must be determined by an exercise of personal judg- 
tn^t, and the wider the knowledge upon which this judgment 
IS ased, the greater confidence will the weights and the result- ' 
values of tfie unknown q^nantities command. 

§ 15, Empirical or Interpolation FormuIsB. In the preceding 
sections attention has been directed to that class of problems 
in which the theoretical relation between the observed quanti- 
les and those whose values are to be determined is known j 
at is, an equation of known form exists between them, and 
the^ problem has been to determine the values of the constants 
which appear in the equation. But a very different class of 
cases now demands. a passing notice. 

-A. series of observations is sometimes found to be affected 
with, errors too great to be explained as the result of unavoid- 
able and fortuitous causes^ and it becomes apparent that the 
law of recurrence of these errors must be determined before 
the observations can be made to yield any valuable results. 
The American parties which were sent out in 1874 to observe 
the transit of Tenus were provided with instruments for the 
determination of their local time, of such a character that the 
accidental error of a determination from a single star might 
fairly he estimated at 0*.05 or 0*.06, but results obtained fcom 
observations of different stars varied among themselves by 
more than, ten times this amount. An inspection of the disH 
crepancies having shown that they depended in some way upon 
the distance of the observed star from the zenith, it was found 
by trial that the error at any zenith distance;, z, could he repre- 
sented by the expression 

jE?=s ± facosz — dsia2s!| 
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■^^ere a and 5 are constants whose values were found from the 
observations themselves. The physical cause of these errors 
was sixbsequently found to be the bending- of the instrument 
under its own weight, but it is to be noted that the above law 

o recurrence of the’ errors was determined first, the ca’use 
afterwards^ 

Expressions of this kind are sometimes called interpolation 
fo'i midce and sometimes empirical equations; the one term hav- 
ing reference to their use, the other to their derivation. They 
are of very general use in all branches of physical science, 
since they may be made to serve as a convenient summary of a 
vast amount of numerical data, ^d one of the most important 
applications of the method of least squares is in determining 
the values of the constants which enter into such expressions. 
The problems treated in §§ 1 and 10 both belong to this class, 
and the following expression for the magnetic declination at 
Washington,. D.C., derived by Mr. O. A. Schott* from a series 
of observations extending over ninety years may serve as a 
. .further illustration : ' • 

Mag. DDec. = 2^47 -h 2^50 sin [r.40 (T-1850) — 14®, 6] 

where T denotes the year for which the declination is required. 

/ When the^caiiae whose effects are to be represented by an 
equation' is known, the form of this equation can usually be 
derived by mathematical analysis j but where empirical formulse 
are employed other methods must he resorted to. The simplest 
of these is a graphical representation of the errors or other 
data under consideration. For this purpose let the errors 
represent ordinates, and the values of any variable upon which 
they are supposed to depend, the corresponding abscissas. Ijet 
poiats.be plotted with these^i.dsrdinates and abscissas as was 
done in obtaining -the form of the error curve, Figs. A, B, G, D, 
and let a smooth curve be drawn through these points either 
free-haiid or by the aid of a draughtsman's irregular curve." 
The distance of each plotted point from, the curve, measured 
along an ordinate, is the residual corresponding to the point, 


* XT. S. C. S. Report, 1882 , p. 258 . 
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in accoiM.uinee witli tho prhicLpIe of least srpiaros the curve- 
sUoultl be .'30 dra^.vii a.s to make the sum of the srpiares of these 
residuals as small as possible, without unduly complicating the 
curve. If the variable has been properly chosen it will in 
many cases be found -possible to draw a simple curve which, 
shall represent the data within the limits of the accidental 
errors of observation, and as this curve is the graphical rep- 
resentation of the law required, its equation, is the 

analytical representation of that law. In this manner the 
form of the equation treated in § 10 was obtained* 

In other cases it will be possible to draw a smooth and sim- 
ple curve wliich shall not represent the data within the limits 
of accidental error of the observations, but about which the 
points will be grouped, alternating from one side to the other 
in a systematic manner. Let the excess of the ordinate of any 
point over the corresponding ordinate of the curve be plotted 
with the given abscissa in a new curve. The two curves thus 
constructed will together form the graphical representation of 
the law of the data, and the analytic expression of that law 


will be 




if y and y = are the equations of the two curves 

respectively. 

In some cases the curves themselves will be a sufficient 
representation of the data, and it will be imnecessary to deter- 
mine their equations since the value of y corresponding to any 
given value of a? may he obtained by direct measurement. In 
other cases the curve will be chiefly serviceable in suggesting 
the probable form of an equation between the observed qu^- 
tity and a variable upon which it is supposed to depend, ot m 
showing that no simple relation exists be^een them- 
, forms of equation are of such frequent use in this oonnec on , 

that they deserve especial notice. 

If the plotted curve does not differ very greatly feom a 
straight line, the relation of the variable a? to its tamticHa y 
may he represented by 

y sss ct -f- hx -|- ca? -f- da^ -h etc. 


idS) 
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This eq^iiatiou contains the first few terms of an infinite 
senes by which a limited arc of any continuous curve can be 
represented, and since the actual relation between y and x 
could be represented by a curve, if its mathematical expression 
were known, it follows that the above equation can be made to 
represent this relation over a certain range of values of a?, by 
assigning proper values to the coefficients. The number of 
terras of this series which should be taken into account, and 
the limits of x beyond which the equation is not applicable, 
depend upon the .actual relation between y and ar, and are 
therefore unknown ; but, in general, it is not well to attempt to 
use this equation for large values of aj,. or when more than 
three or at most four terms are required. Its application in a 
simple case is illustrated in the problem of § 1, where y and x 
being replaced by the length of the bar and its temperature, it 
IS assumed that their relation can be expressed within the 
range of temperature over which the observations extend, by 
the first two terms of the series. 

The second type of equation above referred .to is 

y = c*o -f* Oh cos-~ -f* % cos — - -f- 0^3 cos — -jr etc. 

m m ' m ' 

'4- S]^in — 4* ^3 sin — -f- bs sin— -f- etc. 

^ m m m 

in which m is an undetermined constant expressed in the same 

unit as a? ; — is therefore a ratio, or absolute number, which 
m 

in the application of the equation to numerical data must 
be transformed into circular measure by multiplying it by 

= 57^.29578. This form ot equation may be made to 

TT • 

represent any relation whatever between finite valuea of y and 
X, including those cases in which ^ is a discontinuous function, 
but it is especially advantageous when y is a periodic function 
of cc, i.e., one in which the same values of y recur for values of 
X, separated by a constant interval, r, called the period of x, 
so that „ ' 




62 


THE ^[ETirOD OF LEAST SQUAKES. 


/(./•) =/(.>; -f r) -f 2r) = ••• =/(^ + nr). 

The simplest type of such a function is ?/ =; sin a;, the period 
in this case being r = 300° j e.g., 

sin 10° = sill (10° + 3G0°) = sin (10° + 720°) = etc. 

"When y is such a function the constant should be put equal 

to the period divided by 2 ' 7 r, m = in other cases the value 

of m should be so taken that the greatest value of — ‘ included 

m 

among the data lhall not exceed -n-r The application of this 
formula may often be facilitated by noting that if the relation 
between y and a? is such that /(+ a*) =/(— a?), the sine terms 
all disappear, and the equation reduces to 

y = Go + Oi cos— + tta cos— + etc. (37) 

m m 

while if /(a?) = — /( — a?), the cosine terms vanish, and the 
equation beconies . 

y = -f- Sj^sin— + 62 sin— -t-etc. (38) 

m m 

The several forms above -given to this type of equation are 
those most convenient for use when the values of the coeffi- 
cients a, b, etc., are to be determined, but after their nunierical 
values have been found it is advantageous to transform the 
equation as follows ; Introduce the auxiliary quantities 

no, Wi, Wa, etc. 

defined by the relations 

Wo = ao % cos^i^i = ai n* cos 1^2 = ch 

Wi sin = 6 i Wj sin 

and the equation becomes 

y =— Wo 4- ni cos — jyi\ -f- Wgcos -h etc. 

' \m . / J 

each pair of terms of the onginal equation being here replaced 
by a single term. The expression for the magnetic declination 
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empirical or interpolation eormcjl*e, 

Rt ^asMiigtou g'ivea on 50 is of this type, as may be seen 
y ^Miting it in the e(iui valent form , 

Mag. Dec. = 2°.47 + 2“.o0 cos [1°.40(T-1850) - 104°.ej 

y>-0<ie of applying this form of equation may he illus- 
of ^ ^eans of the following data selected from the series 
Observations whose residuals are plotted in Tig. C. The 
quantity, JS,. is the difference of stellar magnitude ‘ 
\ tness) between the planet Saturn and his satellite lape- 
quantity I given with each observed JB Gxes the 
position of the satellite in its orbit at the time of observation, 
3-^ is analogous to the variable angle ^ in a system of polar 

coordinates. 


z. 

n. 

Sesidual. 

Z. 

JB, 

Residual. 

10° 

m 

m 


m 


10.82 

— 0.28 

200° 

10.66 

+ 0.28 

70 

11.81 

+ 0.22 

230 

9.87 

— 0.21 

110 

11.69 

, —0.02 

270 

10.43 

+ 0.21 

140 

11.42 

— 0.03 

310 

10.48 

— 0.15 


-S is here seen to run through a complete cycle of values 
between the limits 9.87 and 11.81, while I varies from 0*^ to 
360 . "We shall therefore endeavor to represent 3 as a periodic 
function' of ^i^whese period is 360°. In accordance with thift 
assumption we pjit 


X 

m 


I 

360° •* 
"Ztc 


180° 

TV 


= ?; y:=B 


and taking into account the firs^ five terms of the series, the 
several observations furnish the following 

at 

Obsebvatioh" Equations. 


10,82 = Oo 4- 0.98 + 0.17 + 0-940* 4- 0.34^ 

11.81 = ao -h 0.34 Oi -f 0,94 6i ~ 0.770* 4- 0.645, 
11169 = Oo ~ O.S4ai 4- 0.94 - 0.770* — 0.646, 

11.42 = Oo — 0.77 oj 4- 0.64 6, 4- 0.170* — 0.986, 
16.66 = o;, — 0.940, — 0-34 6, 4- 0.77 o* 4- 0.646^ 
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the 3rETHOD OP LEAST SQUARES. 


O.iS* =2 — 0.(3-i- (*i — O.T t bi — 0.1, / cin -f- 0.08 ^2 

10.43 == c\ -f- 0.00 «i — 1.00 — 1.00 a., -f- 0.00 bs 

10.48 = H- 0.64^1 — 0.77 b^ — 0.17 a>2 — O.OSds 

The solution of these equations will be found in the follow- 
ing section. The values obtained for the constants are 

Co =10.92, ai = -f.0.1o, 5i = -f-0.74, ^2 = — 0.04, Z>2=“0.1S 

Introducing the constants N'j and determining their values 
from the relations 

tiq = 10.92 COS JTx = -h 0.15 «2 cos = — 0.04 

Ux sin.^ = 4- 0.74 ng sin.^ = — 0.18 

the equation becomes 

B = 10.92 -h 0.76 cos (I — 78^5) — 0.18 cos (2 1 — 77°.5) 

The residuals obtained by comparing the values of JB computed 
from this formula with the observed values are given above 
with the data. 

Abundant data for exercise in deriving empirical formulae of 
this hind may be found in the United States Coast and G^&>detic 
jSurvey Beport for 1882, pp. 218—257. 

§ IS. Approximate Solutioiis. It is ofteui desirable to obtain 
from a sei‘ies of observations, as rapidly and with as iitj;le labor 
as possible, a set of values of the unknown quantities involved 
which shall be fair approximations to their most probable val- 
ues, but in which the highest degree of accuracy is not required. 
In cases of tbia kind the least square treatment of the obser- 
vation equations as illustrated in § 10 is too long and laborious, 
the following method may often be substituted for it with 
advantage. 

luet there be any number, e.g.,. three, unknown quantities in- 
-yolved in a set of observation equations of the form 

aae-^by-^ cz + n^O Weight =p 

and let each of these equations be multiplied by its weight 
gt V T Ti g the group of equations, 



APPROXIMATE SOLUTIONS, 


66 


ctiix 4- bii/ + CiZ 4- = 0 A’l 1 

+ b.j^ -1- CjZ 4- ?io = 0 Afl > (40) 

etc. etc. etc. etc. J 

Miiltiply each, of these equations by the undetermined con- 
stant k placed opposite it, and let the sum of all the result- 
ing equations he formed. By the use of the summation 
symbol, this sum may be written 

[Aa]a5 + [Ar6]y 4 [kG]z 4[A?i] = 0 


several values of k which enter into this equation 
are entirely arbitrary it would be permissible to assign to them 
such. Values that [A;6] and [Ac] should each equal 0, which 
would give at once 



(41) 


This, however, is not practically advantageous on account, of 
the labor involved in determining the values of A. We there- 
fore put 


_ [ArQ im Uco] 

~ [Aa] [Aa]^ [Aa] 


(42) 


and, limiting the values of A to -f- 1 and — 1, assign them in 
such a manner that [Aa] shall be made as great, and [A&], 
[Ac] as small as possible. In this manner the coefficients of 
y and z may often be made so small that if approximate values 
of y and z are substituted in equation (42), they will furnish 
a sufiS-cieutly accurate value of £c; since the effect of the errors 
of these approximations will be much dimimshed by the smal l 
■ eoefdcients by which they are multiplied 

The value of y may be found in the same manner by sdect- 
iug a set of As which shall make [A&] laige and [Aa], [Ac] 
small, and similarly for z. Two or three trials may be required 
before sufficiently close approximations to the values of. z, z 
are. obtained, but these trials are rapidly and easily made, ^d, 
if necessary, in exceptional cases the summation equations 
may be written in the form 
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[.Va ].c+C^-'& j/Z+Cit'c + ] = 0 

[*■' c( ] ,B + [/.•" 6 ]i/ + [A-''o ]* + [A'" » 3 = 0 
[*"' a] B + [A"' 6]2/ + [A”'c ]2 + [*"' n] = 0 



(43) 


and the eq^uations solved by any of the methods of elementary 
algebra, bat in every case the values of A;, +1 and — 1, should 
be so chosen that in the first equation the coefficient of a?, in 
the second equation the coefficient of y, and in the third equa- 
tion the coefficient of z, shall be made as large, and all of the 
other coefficients as small as possible. 

3y this mode 'of solution each observation ■with its proper 
weight is included in the determination of the unJbaowns, but 
since the principle of least squares has not been taken into 
account, it cannot be expected that the resulting values will 
be the best that the observations can be made to yield. 

To illustrate the mode of Solution we recur to the observa- 
tion equations contained in the preceding section and put'ting 
Oq = 10.00 -f- a write them as follows, placing the several val- 
ues of k at the right of each equation. 


cos 3 sinZ COS2Z sin2Z k^ 


0.82=a-f-0.98 Oi-|-0.17 5i+0 94a2"f“^'^^2 
1.81=a4-0.34 ai-h0,94 5i— 0.77 OgH-O.Gl &2 
1.69=a — 0.34ax+0.945i — 0.77 af2—0.64:ba 
1.42=a— 0.7Tai+0.645i-i-0.17a2— 0.98&2 

0.66=a— 0.9401—0.34 &i-f-0.77 03+0.64 &3 

—0.13=a— 0.64 ai— 0.77 61—0.17 02+0.9852 
0.43==a+0.00ai— 1.00 61— 1!00 a>+0.00 62 
0.48=a+0.64oi— 0.7761—0.1703—0.9862 


+1+1+1+1+1 
+ 1 + 1 + 1 — 1+1 
q_l_l + l— 1— 1 
+ 1 — 1+1 + 1 — 1 
+1_1— 1+1+1 
+ 1 - 1 - 1 — 1+1 
+ 1+1— 1—1— T 
+ 1 + 1 - 1 + 1-1 


The summation equations obtained from this group are 
+ r.l 8 = 8a — 0.73 Oi — 0.19 61 — 1-00 02 + 0.00 63 
■ — 0.10 = Oa + 4-65oi — 1.136i — 1-00 o, + 0.00 62 


_l_ 4.30 = Oa + 1.15 cBj + 5.o7 61 + 0.14 + 0.00 63 

■— 0.42 == 0 a + 0-53 — 0.41 6^ + 4.42 ctf^ + 0.00 


> 


f — 0.86 = 0 a + 0,2101 + 0 . 1963 + 2 . 5402 + 5 . 20 ^ J 


(44) 
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J 




and these correspond to the normal ecpiations of a least square 
solution. To apply the method of approximations to le so u- 
tion of this group of equations, we write them m the form : 

a s= + 0.897 + 0.091 «! + 0.024 &i + 0.125 ^ 

ai = — 0.0fi2 + 0.000 ffli 4-0.243 61 + 021 jc... 

61 = + 0.772 - 0.206 ai + 0.000 61 - 0.025 Uj 
Oj = - 0.095 - 0.120 Oi + 0.093 61 + 0.000 ctj 
. 62 = - 0.166 - 0.040 Oi - 0.036 61 - 0.488 eta 

The divisions required in making this transformation were 

made by the use of (JreMe’s Rechentafdn. _ 

By. operations which can be performed mentally, we ob am 

the following sets of approximations to the values of 


<h 
*2 


I. 

0.0 

+ 0.8 

- 0.1 


IL 

+ 0.2 
+0.7 
- 0.0 


in. 
+0.15 
+0.74 
-0.04 


and substituting in equations (45) the values given under in. 

we find the adopted values / ■ 

a,Cl0 + a = 10.92 Oi = + 0-15 J = " J- 

\ " &i = +0.74 62 = -0.18 

which were employed iuxthe preceding section. 
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y.= 

• # • 


• • • • 

V^TT 




1 ' [di;] 


« 

• ♦ • • 

2k^~ n 




_ [pm] 

^0— r.^-i • * 

• • • 

♦ ••••• 

1 

• • * • 


X + [a&] y -h [cw] z -f [cm] = 0 

C«&] = i { [(^^ + - (M + W) i 

j ^ as ] = [aa] 4 - [<'^^] + + • * • + [<^^] 

C6c.l] = [6c]-g|[ac]. . . . 

Ccn.2] = Cc«.13-g|^CW:i] . 
[oa] + [a|8] fta <^ (% 2^0, *») = <>• 


y — 9-ffI = ±0.674 
h 

r ri - 

’’“"Vn VM 

t 1 1 § 1 1 

y .yi. _pM . 


,EI= ± 0. 

\n — fi 


V»i 


fx 

TiT^ t /vk nr\Ci /'i£_2?i\+etc. . 


y = n, + % cos - JTij + %cos - jVsj 

[&«] ^ 

. ^ CC = -=r= — T -r r7..«T ^ ^ 


w . 


ifc — db 1 


« * 


§ 3,4 

. § 5 

: § 6 
. § r 
. § 9 
. § 9 
. § 9 

. § 9 
. §11 

. §12 
. §13 

, §14 

. §15 
. §15 

. §16 
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The Method of Least Squares, 

"With Numerical Examples of its Application. By GEoaoB C. Co»^ 
STOCK, Professor of Astronomy in the University of Wisconsin, and 
Birector of the Washburn Observatory.. 8vo. Cloth, viii -1- 68 pages. 
Mailing price, $1.05; for introduction, $1.00. 

College Requirements in Algebra: a Final Review. 

By George Parsons Tibbets, A.M., Instructor in Mathematics, 
Williston Seminary, 12nio. Cloth. 46 pages. By mail, 55 cents ; to 
teachers and for introduction, 50 cents. 

Peirce's Elements of Logarithms. 

With an explanation of the author’s Three and Four Place Tables. By 
Professor Jambs Mints Peirce, of Harvard University. 12mo. Cloth, 
so pages. Mailing price, 55 cents : for introduction, 50 cents. 


Mathematical Tables 'Chiefly to Four Figures. 

With full explanations. By Professor James Mii<ls Peirces, of Har- 
vard University- 12mo. Cloth. Mailing price, 46 cents; for intro- 
duction,,.40-ceuts. 


Elements of the Differentiaf Gaicuius. 

With numerous Examples and Applications- Designed for Use as a 
College Text-Book. By W, E. Bterlt, Professor df Mathematics, 
Harvard University. 8vo. 273 pages. Mailing price, $2,16 ; for intro- 
duction, $2.(^ 

fJ^HE peculiarities •e-t this treatise are the rigorous use of the 
Doctrine of Limits, as a foundation of the subject, and as 
preliminary to’ the adoption of the more direct and practically 
convenient infinitesimaL notation and nomenclature ; the early 
introduction of a fev? simple formulas and methods for integrat- 
ing ; a rather elaborate treatment of the use of infinitesimals in. 
pure geometry ; and the attempt to excite and heep up the interest 
of the student by bringing in throughout the whole book, and not 
merely at the end, numerous applications to practical problems ii 
geometry and mechanics. 

E. H. Moore, PrifessoroJ ifathe- j ablest text-boot on the calculus ye 
mdtics, University (tf Chiaago: The I written by an ALmerican. 
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Elements of ' the Inte<jKCLl Oalculus. 

■■iw PI wi I ■ ■ I I I X tr -n 



mills work contains, iu addition to the subjects usually treated 

I _ M ^ AH in it ^ 1 


■*■ in a text-book on the Integral Calculus, an introduction to 
Elliptic Integrals and Elliptic Eunotions ; the Elements of the 
Theory of Functions ; a Key to the Solution of Differential Equa. 

tions ; and a Table of Integrals. 

John E. Clark, Pro/, of Mathe,- 


maiics, Shejffield Bcientijic School of 
IctZe University .* Tlie additions to 
the present edition, seem to me most 
judicious and to greatly enhance its 


value for tlie purposes of university 
instruction, for which in several im- 
portant respects it seems to me better 
adapted than any other American 
text-book on the subject. ' * 


1 J - 4 Wl# ^ m ^ ^ ■ ! m 

Spherical. Cylindrical, end 

^</^s to Ph*D ^JProfessor of Mathematics, Hair^ 

introduction, $3.00. ^ .i.* x * .s-p 

rpHIS book is intended as an introduction to the treatment of 
some of the important Linear Partial Differential Equations 
which lie at the foundation of modern theories in physic, and 
deals mainly with the methods of building up so u i 
differential equation from easily obtained particular solutions, m 

such a manner as to Satisfy given initial conditions. 

to suy that although it seemed to b< 
■written expressly, for me, one of mj 
friends who is a great mathemati 


John. Perry, Technical College, 
Jf'insbw'y, Ijondon, Kngland^ Byer- 
ly’s book is one of the most useful 
books in existence. I have read it 
with great delight and I am happy 


cian, seems as delighted with it as . 
am myself. 


A Short Table of Integrals. Revised and Bntarge^. 


-o_ r ' O PfiliRCE Prof. Math., Harvard XJniv. 32 pages, 
pnee,* 15 c^ts. Boun<? a\so with Byerly^s Calculus. 


Mailir 


Byerly's Syllctbi. 


Each. 8 or 12 P*«“» 

ta Glomctty, the Theory of Equations. 
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Ofrectionaf Oa/cu/us. 

By E. W. Htde, Professor of Mathematics in the University of Cincin- 
nati, Svo. Cloth. yJi + 3^7 pages, with blank leaves for notes. Price 
by mail, 32.15; for introduction, 32.00. 

^HIS work follows, in the main, the methods of Grassmann’s 
Ausdehnungslehret but deals only with space of two and three 
dimensions. The first two chapters which give the theory and 
fundamental ideas and processes of his method, will enable students 
to master the remaining chapters, containing applications to Plane 
and Solid Geometry and Mechanics ; or to read Grassmann’s original 
works. A very elementary knowledge of Trigonometry, the Differ- 
ential Calculus and Determinants, will be sufficient as a preparation 
for reading this book. 


Daniel Carhart, Pro/, of Mathe- 
matics, Western University of F^n- 
sylvania: I am pleased to note the 
success which has attended Professor 


Hyde’s efforts to bring into more 
popular form, a branch of mathemat- 
ics which is at once so abbreviated in 
form and so comprehensive in results. 


Elemerjts of the Differential and Integral Caleulus. 


"With Examples and Applications By J. M. Tatuor, Professor *ol 
Mathematics in Colgate University, 8vo. Cloth- 249 pages. Mailing 
price, 31.95; for introduction, 31*8t>. 


^HE. aim of this- treatise is to present simply amd concisely the 
fundamental problems of the Calculus, their solution, and more 


common appliftatious. | 

Many theorems are proved both by the method of rates and that 
of limits, and thus each is made to throw light upon the other. 
The chapter on differentiation is followed by one on direct integrar 
tion and its more important applications. Throughout the work 
there are numerous practical problems in Geometry and Mechanics, 
T^hich serve to exhibit the power and use of the science, and to 
excite and keep alive the interest of the student. In February, 189 1, 
Tayior’s Calculus was found to be use in about ^ixty colleges. 


The Katioii, New York : In the 
first place, it is evidently a most 
carefully written book, . . . 'W’e are 
acquainted with no text-book of the 
Calcnlos which compresses so much 
matter into sojfew pages, and at the 
same time leaves the impression tliax 


all that is necessary has been said. 
In the second place, the number of 
carefully selected examples, both 
those worked out in, full in illnstr 
tiou of the text, and of those left 
the student to work out for 
is extraordinary. 
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